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PREFACE 



The longer an engineer has been separated from his alma mater, the 
fewer mathematical formulas he uses and the more he reUes upon tables 
and, when the latter fail, upon graphical methods. Although graphical 
methods have the advantage of being ocular, they frequently sirffer from 
the fact that only what is seen is sensed. But this defect is due to the 
kind of graphics used. With the aid of the scientific art of graphing pre- 
sented in Chapter I, one may not merely make better graphs in less time 
but actually draw correct negative conclusions from a graph so made, 
and therefore sense more than one sees. For instance, one may be sure 
that a given cubic equation has only the one real root seen in the graph, 
if the bend points lie on opposite sides of the x-axis. 

Emphasis is here placed upon Newton's method of solving numerical 
equations, both from the graphical and the numerical standpoint. One 
of several advantages (well recognized in Europe) of Newton's method over 
Homer's is that it applies as well to non-algebraic as to algebraic equations. 

In this elementary book, the author has of course omitted the dLEcult 
Galois theory of algebraic equations (certain texts on which are very 
erroneous) and has merely illustrated the subject of invariants by a few 
examples. 

It is surprising that the theorems of Descartes, Budan, and Sturm, on 
the real roots of an equation, are often stated inaccurately. Nor are the 
texts in English on this subject more fortunate on the score of correct 
proofs; for these reasons, care has been taken in selecting the books to 
which the reader is referred in the present text. 

The material is here so arranged that, before an important general 
theorem is stated, the reader has had concrete illustrations and often also 
special cases. The exercises are so placed that a reasonably elegant and 
brief solution may be expected, without resort to tedious multiplications 
and similar manual labor. Very few of the five hundred exercises are of 
the same nature. 

Complex numbers are introduced in a logical and satisfying manner. 
The treatment of roots of unity is concrete, in contrast to the usual ab- 
stract method. 

Attention is paid to scientific computation, both as to control of the 
limit of error and as to securing maximum accuracy with minimum labor. 

An easy introduction to determinants and their appUcation to the solu- 
tion of systems of linear equations is afforded by Chapter XI, which is 
independent of the earlier chapters. 

« • • 
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IV PREFACE 

Here and there are given brief, but clear, outlooks upon various topics 
of decided intrinsic and historical interest, — thus putting real meat upon 
the dry bones of the subject. 

To provide for a very brief coxurse, certain sections, aggregating over 
fifty pages, are marked by a dagger for omission. However, in compensa- 
tion for the somewhat more advanced character of these sections, they are 
treated in greater detail. 

In addition to the large number of illustrative problems solved in the 
text, there are five hundred very carefully selected and graded exercises, 
distributed into seventy sets. As only sixty of these exercises (falling into 
seventeen sets) are marked with a dagger, there remains an ample number 
of exercises for the briefer course. 

The author is greatly indebted to his colleagues Professors A. C. Lunn 
and E. J. Wilczynski for most valuable suggestions made after reading 
the initial manuscript of the book. Useful advice was given by Professor 
G. A. Miller, who read part of the galley proofs. A most thorough read- 
ing of both the galley and page proofs was very generously made by 
Dr. A. J. Kempner, whose scientific comments and very practical sugges- 
tions have led to a marked improvement of the book. Moreover, the 
galleys were read critically by Professor D. R. Curtiss, who gave the author 
the benefit not merely of his wide knowledge of the subject but also of his 
keen critical ability. The author sends forth the book thus emended 
with less fear of future critics, and with the hope that it will prove as 
stimulating and useful as these five friends have been generous of their 
aid. 

Chicago, February, 1914. 
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CHAPTER I 
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ERRATA 



Page iii, line 11. The words "opposite sides" should read "the same 
side." 

24, fifth line from bottom. "2 0**" should read "240**." 

27, line 3. " Ang e " should read " angle." 

27, Fig. 14. The letter r should appear at second point of division. 

39, equation (8). Blurred letters should read " a;^r4." 
100, equation (10). " 4 sL " should read " 4 sUr 
129, equation (10). " - b,B^ " should read " - 6,B,." 
179, line 2. " 2, 2, 3" should read " 2, 2, - 3." 
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y which a point determines in this manner are 
called its coordinates. Conversely, any pair of 
real numbers determines a point. 

Figure 1 shows the points which represent 
various pairs of values of x and y, satisfying 
the equation 



(1) 



2/ = x^ — 6 X — 3. 




(«r8) 



(5,-8) 



(4,-U) 



For example, the point P represents the pair 
of values x = 4, t/=— 11, and is designated 
(4, —11). Since the value of x may be as- 
signed at pleasure and a corresponding value of y is determined by 

equation (1), there is an infinitude of points representing pairs of values 

1 



2 THEORY OF EQUATIONS ICn 1 

satisfying the equation. These points constitute a curve called the graph 
of the equation. 

In Fig. 1, the curve intersects the x-axis in two points; the abscissa 
of one point of intersection is between 6 and 7, that of the other point is 
between — 1 and 0. The x-axis ia the graph of the equation y = 0. Thus 
the abscissas of the intersections of the graph of equation (1) and the 
graph of ^ = are the real roots of the quadratic equation 
(!') z=-6i-3 = 0. 

Hence to find graphically the real roots of the last equation, we equate 
the left member to y and use the graph of the resulting equation (1). 
For other methods, see §§ 16-18. 

EXERCISES 

1. Find graplucally the real roots otx^ — 6x + 7 = 0. 

2. Discuss graphically the reality of the roots of i' — 6 t + 12 = 0. 

3. Obttun the graph used in Ex. 1 by shifting the graph in Fig. 1 ten units 
upwards, leaving the axes OX and UY unchained. How 
may we obttun similarly that used in Ex. 2? 

4. Locate graphically the real roots ofi:' + 4z' — 7 — 0. 

2. Caution in Plotting. If the example set were 
(2) y = Sx*-Ux*-93?+nx-2, 

one might use successive integral values of x, obtain 
the points (-2, 180), (-1,0), (0,-2), (1,-0), 
(2, 0), (3, 220), all but the first and last of which are 
shown (by crosses) in Fig. 2, and be tempted to con- 
clude that the graph is a U-shaped curve approxi- 
mately like that in Fig. 1 and that there are just two 
real roots, — 1 and 2, of 
(2') 8;H- Ui^-ga:* -Hill- 2 = 0. 

But both of these conclusions would be false. In 

fact, the graph is a W-shaped curve (Fig. 2) and the 

additional real roots are J and i. 
This example shows that it is often necessary to 

employ also values of x which are not integers. The 
purpose of the example was, however, not to point out this obvious fact, 
but rather to emphasize the chance of serious error in sketching a curve 




Fig. 2 
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THE GRAPH OF AN EQUATION 



through a number of points, however numerous. The true curve between 
two points below the x-axis may not cross the x-axis, or may have a 
peak actually crossing the x-axis twice, or may be an M-shaped curve 
crossing it four times, etc. 



For example, the graph (Fig. 3) of 



(3) 



2/ = x^ + 4x* — 11 



crosses the x-axis only once. But this fact can not be concluded from 
a graph located by a number of points, how- 
ever numerous, whose abscissas are chosen at 
random. 

We shall find that correct conclusions re- 
garding the number of real roots can be de- 
duced from a graph whose bend points (§3) 
have been located. 

We shall be concerned with equations of the 
form 



OoX** + ttiX**-^ + 



. • . 



+ ttn-lX + On = 

(ao 5^ 0), 



in which Oo, ai, . . . an are real constants. 
The left member is called a polynomial in x of 
degree n, or also a rational integral function of x, 
and will frequently be denoted for brevity by 
the symbol /(x) and less often by /. 




Fig. 3 



3. Bend Points. A point (like M or Af' in Fig. 3) is called a bend 
point of the graph of 2/ = /(x) if the tangent to the graph at that point 
is horizontal and if all of the adjacent points of the graph lie below the 
tangent or all above the tangent. The first, but not the second, condi- 
tion is satisfied by the point of the graph oi y = x^ given in Fig. 4 
(see § 6). In the language of the calculus, /(x) has a (relative) maximum 
or minimum value at the abscissa of a bend point on the graph of y = 

fix). 

Let P = (x, y) and Q = (x + A, Y) be two points on the graph, 
sketched in Fig. 5, of y = /(x). By the slope of a straight line is meant 
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the tangent of the angle between the line and the x-axis measured counter- 
clockwise from the latter. In Fig. 5, the slope of the straight line PQ is 

Y-y _ f(x + h)--f(x) 
h h 



(4) 
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Fig. 4 
For equation (3), /(x) = x* + 4x* — 11. Hence 



Fig. 6 



/(x + A) = (x + A)» + 4 (x + hy - 11 

= x3 + 4x2- 11 + {Sx^ + 8x)h + {Sx + 4:)h' + hK 

The slope (4) of the secant PQ is here 

3x« + 8x + (3x + 4)A + A2. 

Now let the point Q move along the graph towards P. Then h approaches 
the value zero and the secant PQ approaches the tangent at P. The 
slope of the tangent at P is therefore the corresponding limit 3 x* + 8 x 
of the preceding expression. 

In particular, if P is a bend point the slope of the tangent at P is zero 
and hence x = or x = — §. Equation (3) gives the corresponding 
values of y. The resulting points 

M = (0,-11), M' = (-5, -H) 
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are easily shown to be bend points. Indeed, for a; > and for x between 
—4 and 0, x* (x + 4) is positive, and hence /(x) > — 11 for such values of 
X, so that the function (3) has a relative minimum at x = 0. Similarly, 
there is a relative maximum at x = — |. We may also employ the general 
method of § 8 to show that M and Af ' are bend points. Since these bend 
points are both below the x-axis, we are now certain that the graph 
crosses the x-axis only once. 

The use of the bend points insures greater accuracy to the graph than 
the use of dozens of points whose abscissas are taken at random. 

4. Derivatives. We shall now find the slope of the tangent to the 
graph of 2/ = /(x), where /(x) is any polynomial 

(5) /(x) = oox" + aix~-^ + • • • + a„_ix + a„. 

We need the expansion of /(x + ^i) in powers of x. By the binomial 
theorem, 

ao(x + hY = Oox" + nooX^-^A + Z — - aox"-^^* + . . . , 
ai(x + hy-^ = aix>>-^ + (n - l)aix"~^fe + ^"^ " "^^ " ^^ a^x-^h^ + • • • , 



a„-2(a: + Kf = a„-2a:2 + 2 an-^ixh + a„-2A^ 

On-l(x + A) = On-lX + On-l/l, 
On = On. 

The sum of the left members is evidently /(x + A). On the right, the 
sum of the first terms (i.e., those free of K) is /(x). The sum of the 
coefficients of h is denoted by /'(x), the sum of the coefficients of \ h^ is 
denoted by/"(x), • • • , the sum of the coefficients of 

ft* 
1-2 • . . A; 
is denoted by /(*> (x) . Thus 

(6) f\x) = noox^-i + (n - l)aiX»-2 + . . . + 2 an-2X + a^-i, 

(7) /"(a;) = n{n - 1) aoX»-2 + (^ - l)(n - 2)aiX»-» + . . . + 2 an-2, 
etc. Hence we have 

(8) /(x + ft) = /(x) + /'(x) ft + r(x) ^ 
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This formula (8) is known as Taylor^ a theorem for the present case of 
a polynomial f{x) of degree n. We call J\x) the {first) derivative of 
/(x), and /"(x) the second derivative of /(x), etc. Concerning the 
fact that J"{x) is the first derivative of /'(x) and that, in general, the 
fcth derivative /(*)(x) of /(x) equals the first derivative of /^*~^K^)» see 
Exs. 6-9 of the next set. 

In view of (8), the limit of (4) as h approaches zero is/'(x). Hence 
f\x) is the slope of the tangent to the graph of y = /(x) at the point (x, t/). 

In (5) and (6), let every a be zero except Oq. Thus the derivative of 
Oox" is naoX"~S and hence is obtained by multiplying the given term by 
its exponent n and then diminishing its exponent by unity. For example, 
the derivative of 2 x' is 6 x^. 

Moreover, the derivative of /(x) equals the sum of the derivatives of 
its separate terms. Thus the derivative of x* + 4 x^ — 11 is 3 x* + 8 x, 
as found also in § 3. 

6. Computation of Polynomials. The labor of computing the value 
of a polynomial /(x) for a given value of x may be much shorten:ed by 
a simple device. To find the value of 

x» + 3x2~2x-5 

for X = 2, we note that x* = x • x^ = 2 x*, so that the sum of the first two 
terms is 5 x^. This latter equals 5 • 2 x or 10 x, adding this to the next 
term — 2 x, we get 8 x or 16. The final result is therefore 11. 
Write the coefficients in a line. Then the work is: 

1 3-2 - 5 [2 

2 10 16 

15 8 11. 

In case not all the intermediate powers of x occur among the terms of 

/(x), the missing powers are considered as having the coefficients zero. 

Thus the value —61 of2x* — x^ + 2x— 1 for x= —2 is found as 

follows: 

2 0-1 2 -l|-2 

-4 8-14 28 -60 

2-4 7-14 30 -61. 

For another manner of presenting this method see Ch. X, § 4. 
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EXERCISES 



1. The slope of the taDgent to y == 8a^ — 22 x^ + ISz — 2 at (x, y) is 
24x2 - 44 a; + 13. The bend points are (0.37,0.203), (1.46, -5.03), approxi- 
mately. Draw the graph. 

2. The bend points of y = x^ - 2x - 5 are (.82, -6.09), (-.82, -3.91), 
appro: jmately. Draw the graph and locate the real roots. 

3. Find the bend points of2/ = x' + 6x2 + 8x-h8. Locate the real roots. 

4. Locate the real roots of fix) = x* + x^ — x — 2 = 0. The abscissas of 
the bend points are the roots of f (x) = 4x^ + 3x2 — 1 = 0. The bend points 
of 2/ = /'(x) are (0, —1) and ( — ^, — f), so that /'(x) = has a single real root 
(it is just less than |). The single bend point of y = f(x) is (J, — f i), approxi- 
mately. 

5. Locate the real roots of x* — 7 x* — 3 x^ + 7 = 0. 

6. f"{x)y given by (7), is the first derivative of /'(x). 

7. If /(x) = /i(x) +/2(x), the A:th derivative of / equals the sum of the A;th \^- 
derivatives of /i and/2. Use (8). I 

8. f^'^^ix) equals the first derivative of p^~^\x). Hint: prove this for / =ax*~; 
then prove that it is true for / = /i + /2 if true for /i and /2. 

9. Find the third derivative of x® + 5 x^ by forming successive first derivatives; 
also that of 2 x*^ — 7 x* + x. .^ 

10. The derivative of gk is g'k + gk\ Hint: multiply the members of ^(x + h) ^ ; ^ '^ 
^(x) + g'{x) A + • • • and k{x + A) = k{x) + k'{x) h+ - - - and use (8) for 
}=gk. 

6. Horizontal Tangents. If (x, y) is a bend point of the graph of 
y = /(^)> then, by definition, the slope of the tangent at (x, y) is zero. 
Hence (§4), the abscissa x is a root of f'(x) = 0. In Exs. 1-5 of the 
preceding set, it was true that, conversely, any real root of /'(x) = 
is the abscissa of a bend point. However, this is not always the case. 
We shall now consider in detail an example illustrating this fact. The 
example is the one merely mentioned in § 3 to indicate the need of the 
second requirement made in our definition of a bend point. 

The graph (Fig. 4) of 2/ = x' has no bend point since x^ increases when 
X increases. Nevertheless, the derivative 3 x^ of x' is zero for the real 
value X = 0. The tangent to the curve at (0, 0) is the horizontal line 
2/ = 0. It may be thought of as the limiting position of a secant through 
which meets the curve in two further points, seen to be equidistant 
from 0. When one, and hence also the other, of the latter points ap- 
proaches 0, the secant approaches the position of tangency. In this 
sense the tangent at is said to meet the curve in three coincident 
points, their abscissas being the three coinciding roots of x' = 0. In the 



8 THEORY OF EQUATIONS ICh. I 

usual technical language which we shall employ henceforth, x' = has 
the triple root x = 0. The subject of bend points, to which we recur in 
§ 8, has thus led us to a digression on the important subject of double 
roots, triple roots, etc. 

7. Multiple Roots. In (8) replace x by a and A by x — a. Then 

(9) fix) =Ka)+r{a){x - «)+/"(«) ^^=|^ +/'"(«) ^^+ .... 

Thus the constant remainder obtained by dividing any polynomial /(x) 
by X — a is /(a), a fact known as the Remainder Theorem. In par- 
ticular, if /(a) = 0, fix) has the factor x — a. This proves the Factor 
Theorem: If a is a root of /(x) = 0, then x —a is a factor of /(x). 

The converse is true: If x — a is a factor of /(x), then a is a root of 
fix) = 0. In case fix) has the factor (x — a)^, but not the factor 
(x — a)^, a is called a double root of fix) = 0. In general, if fix) has 
the factor (x — a)*~, but not the factor (x — a)'"''"^ a is called a multiple 
root of multiplicity m of fix) = 0, or an m-fold root. Thus, 4 is a simple 
root, 3 a double root and --2 a triple root of 

7(x-4)(x-3)2(x + 2)» = 0. 

This algebraic definition of a multiple root is in fact equivalent to the 
geometrical definition, given for a special case, in § 6. 

The second member of (9) is divisible by (x — a)^ if and only if /(a) = 0, 
/'(a) = 0, and is divisible by (x — a)' if and only if also /"(«) = 0, etc. 
Hence a is a double root of fix) = if and only if /(a) = 0, /'(a) = 0, 
/" (a) 7^ 0; a is a root of multiplicity m if and only if 

(10) fia) = 0, ria) = 0,/"(a) = 0, • • • , /(-^)(a) = 0, /(-)(a) 9^ 0. 

For example, zero is a triple root of x* + 2 x* = since the first and second 
derivatives are zero for x = 0, while the third derivative 24 x + 12 is not. 

If fix) and /'(a:) have the common factor (x — a)'""S but not (x — a)"*, 
where m = 2, then a is a root of fix) = of multiplicity m. For, a is 
a root of multiplicity at least m — 1 of both fix) = and /'(x) = 0, so 
that the equalities in (10) hold; also /<"•)(«) ^ holds, since otherwise a 
would be a root of both fix) = and /'(x) = of multiplicity m or greater, 
and (x — a)"* would be a common factor. Hence if fix) and fix) have a 
greatest common divisor g(x) involving x, a root of g{x) =0 of multiplicity 
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m — 1 is a root of f{x) = of multiplicity m, and conversely any root of 
f{x) = of multiplicity. m is a root of g{x)'= of multiplicity m — 1, The 
last fact follows from relations (10), which imply that a is a root of 
f\x) = of multiplicity m — 1, and hence that f{x) and f\x) have the 
common factor (x — a)*"'"^ but not (x — a)*^. 

In view of this theorem, the problem of finding all the multiple roots 
of /(x) = and the multiplicity of each multiple root is reduced to the 
problem of finding the roots of g(x) =0 and the multiplicity of each. 

For example, let f{x) = x^-'2x^ — 4x + 8. Then 

fix) = 3x2 - 4a; - 4, 9f{x) =/'(x) (3a; - 2)- 32 (x - 2). 

Since x — 2 is a factor of /'(x) it may be taken to be the greatest common divisor 
oif{x) and/'(a;), as the choice of the constant factor c in c{x — 2) is here immaterial. 
Hence 2 is a double root of f(x) = 0, while the remaining root —2 is a simple root. 

EXERCISES 

^1. 3^ — 7x^+15x — 9 = 0hBa& double root. 

^2. X* — 8x^ + 16 = has two double roots. 

^3. X* - 6x2 - 8x - 3 = has a triple root. 

\4. Test X* - 8 x» + 22 x2 - 24 X + 9 = for multiple roots. 

"" 5. Test x«-6x2-|-llx-6 = 0for multiple roots. 

8. Inflexion and Bend Points. The equation of the tangent to the 
graph of 2/ = /(x) at the point (a, fi) on it is 

y = r(a) {X'-a)+p [i3 = /(a)]. 

For the abscissas of its intersections with the graph of y = f(x), we have, 
from (9), 

/ (a) ^^^ +f («) 1.2.3 + ^• 

If a is a root of multiplicity m of this equation, the point (a, /3) is counted 
as m coincident points of intersection of the tangent and the curve (just 
as in the example in § 6). This will be the case if and only if * 

(11) /"(a) =0, /'"(a) = 0, . . . , /(-^)(a) = 0, /(-)(a) 7^ 0. 

For example, if /(x) = x* and a = 0, then m = 4. The graph of y = x* is a 
U-shaped curve, whose intersection with the tangent (x-axis) at (0, 0) is counted 
as four coincident points of intersection. 

* If m = 2, only the last relation of the set is retained: /"(a) ^ 0. 
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If m is even, the points of the curve in the vicinity of the point of 
tangency (a, j8) are all on the same side of the tangent and the point (a, /3) 
is, by the definition in § 3, a bend point. But if m is odd (m > 1), the 
curve crosses the tangent at the point of tangency (a, fi) and this point 
is called an inflexion point, and the tangent an inflexion tangent. To 
simplify the proof, take (a, /3) as the new origin of coordinates and the 
tangent as the new x-axis. Then the new equation of the curve is 

y = ex*" + cb^^ + • • • (c 7^ 0, m = 2). 

For X suflSciently small numerically, y has the same sign as ex*" (§ 11). 
Thus if m is even, the points of the curve in the vicinity of the origin are 
all on the same side of the x-axis. But if m is odd, the points with small 
positive abscissas lie on one side of the x-axis[and those with numerically 
small negative abscissas lie on the opposite side. 

For example, (0, 0) is a bend point of the graph of y = x*. But (0, 0) is an 
inflexion point of the graph (Fig. 4) of y = x*, and the inflexion tangent y = 
crosses the curve at (0, 0). Here /"(O) = 0, /'"(O) = 6, so that m = 3, in accord 
with the evident fact that x' = has the root zero of multiplicity 3. 

We have, therefore, in the evenness or oddness of m in (11) a practical 
test to decide which roots a of /'(x) = are abscissas of bend points 
and which are abscissas of inflexion points with horizontal inflexion 
tangents. 

EXERCISES 

1. If fix) = 3 x*^ + 5 x» + 4, the only real root of /'(x) = is x = 0. Show 
that (0, 4) is an inflexion point, and thus that there is no bend point and hence 
that fix) = has a single real root. 

^2. x* — 3x2 + 3x-|-c = has an inflexion point, but no bend point. 

"" 3. X* — 10 x* — 20 x^ — 15 X + c = has two bend points and no horizontal 
inflexion tangents. 

^4. 3 X* — 40 x^ + 240 x + c = has no bend point, but has two horizontal 
inflexion tangents. 

5. Any function x* — 3 ax* -+-••• of the third degree can be written in the 
form fix) = (x — a)' -{- ax -{-b. The straight line having the equation t/ = ax -f 6 
meets the graph of y = fix) in three coincident points with the abscissa a and 
hence is an inflexion tangent. If we take new axes of coordinates parallel to the 
old and intersecting at the new origin (a, 0), i.e., if we make the transformation 
X = X + a, 2/ = K, of coordinates, we see that the equation fix) = becomes a 
reduced cubic equation X' -f pX -i- q = (cf. Ch. III). 

6. Find the inflexion tangent to y = x'-f6x^ — 3x+l and transform 
x' + Gx* — 3x + l = into a reduced cubic equation. 
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9. Real Roots of a Cubic Equation. It suffices to consider 

f{x) =x' -3lx + q (i 5^ 0), 

in view of Ex. 5 above. Then /'= 3 (x^ - I), /"= 6 x. If J < 0, there 
is no bend point and the cubic equation f{x) = has a single real root. 
If Z > 0, there are two bend points 

(V7, g-2ZVZ), i-Vl,q + 2lVl) 
and the graph oi y = f(x) is evidently of one of the three types: 





q-S'ii/r 




Fig. 8 



Fig. 7 



If the equality sign holds in the first or second case, one of the bend 
points is on the x-axis and the cubic 
equation has a double root; the condi- 
tion is that g2 - 4 ^3 = 0^ The third 
case is fully specified by the condition 
(f < 4: P, which implies that I > 0. 
Hence x^ — S Ix + q = has three dis- 
tinct real roots if and only if (f < ^ P, 
a single real root if and only if q^ > 4P; 
and a double root {necessarily real) if and only if q^ = 4: Z'. 

EXERCISES. 

Apply the criterion to find the number of real roots of: 

^1. a;3 + 2x-4 = 0. ^ 2. x^ - 7a; + 7 = 0. ^3. x» - 2x - 1 = 0. 
^4. a:»-3a: + 2 = 0. '^. x^ + Gx^ - 3a: + 1 = 0. 
^. The inflexion point o( y = oc^ — Six + q la (0, q). 

lO.t Trinomial Equations. 

For m and n positive odd integers, m > nj let 

fix) = a:"» + px" + g (j)y£ 0). 
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Here x = is a root of f\x) = only when n > 1 and then the tangent at (0, q) 
is the horizontal inflexion tangent y = g, as shown by (11) with m replaced by n, 
or directly from the fact that zero is a root of odd multiplicity no( z^ + pz*^ = 0. 
Hence in no case is zero the abscissa of a bend point. 

If p > 0, /' has no real root except x = 0. Thus there is no bend point and 
hence a single real root of f{x) = 0. , 

If p < 0, there are just two bend points, their abscissas being b and —6, where 
b is the single positive real root of 6*""" = —rvp/m. The bend points are on the 
same side or opposite sides of the x-axis according as 

/(6)=9 + p6"(l-^), /(_6) = 5 - p6» (l - £) 

are of like signs or opposite signs. The number of real roots is 1 or 3 in the respec- 
tive cases. Hence there are three distinct real roots if and only if the positive 
number 



exceeds both q and — g, i.e., if 



-'*■(' - s 



— p — 6" > -^ 

m m — n 



The first member equals 6"*, so that its (m — n)th power is the mth power of 
l^m-n _ ^fip/m. Hence the conditions are equivalent to 



\ m/ \m — n/ 



EXERCISES t 

l.t x'-+-px-+-9 = has three distinct real roots if and only if 



«>(0"-(I)' 



2.t If p and q are positive, x^^ — px^^ -^ q = has four distinct real roots, 
two pairs of equal roots, or no real root, according as 



(npY* ( nq Y*'^ 
m) \m — n) 



> 0, = 0, or < 0. 

11. Continuity of a Polynomial. Hitherto we have located certain 
points of the graph of t/ = S{x), where J{x) is a polynomial in x with real 
coefficients, and taken the liberty to join them by a continuous curve. 
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The polynomial f{x) in the real variable x shall be called continuous at 
X = a, where a is a real constant, if the difference 

D=f{a + h)- f(a) 

is numerically less than any assigned positive nmnber p for all real values 
of h suJEciently small numerically. 

We shall prove that any polynomial f(x) with real coefficients is con- 
tinuous at X = a, where a is any real constant. 

The proof rests upon Taylor's formula (8), which gives 

Z>. ;.(.,» +g5)».+ ...+j_£^^».. 

Denote by g the greatest numerical value of the coefficients of h, 
h^j . , . f h^. For h numerically less than fc, where ifc < 1, we see that D 
is nmnerically less than 



The same proof shows that, if Oi, . . . , an are real, ai/i + • • • + OnA* 
is numerically less than an assigned positive number p for all real values 
of h sufficiently small numerically. 

12. Theorem. // the coefficients of the polynomial f{x) are real and if 
a and b are real numbers such that f(a) and f(b) have opposite signs, the 
equation f(x) = has at least one real root between a and b; in fact, art odd 
number of such rootSy if an m-fold root is counted as m roots. 

The only argument* given here is one based upon geometrical intui- 
tion. We are stating that, if the points 

(a, /(a)), (bj(b)) 

lie on opposite sides of the x-axis, the graph ot y = f{x) crosses the 
X-axis once, or an odd nmnber of times, between the vertical lines through 
these two points. Indeed, the part of the graph between these verticals 
is a continuous curve having one and only one point on each intermediate 
vertical line, since the function has a single value for each value of x. 
This would not follow for the graph of y^ = x. 

* An arithmetical proof based upon a refined theory of irrational numbers is given 
in Weber's Lehrbuch der AlgebrGf ed. 2, vol. 1, p. 123. 
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13. Sign of a Polynomial. Given a polynomial 

f{x) = oox" + aix"""^ + • • • + a„ (oo ^ Q) 

with real coefficients, we can find a positive number P such that f{x) has 
the same sign as oox" when x > P, In fact, 

/(i)=x-(ao + «), * = ^ + §+--- +i=- 

By the last result in § 11, the nimierical value of <f> is less than that of Oo 
when l/x is positive and less than a sufficiently small positive nmnber, 
say 1/P, and hence when x > P, Then Oo + <f> has the same sign as a©, 
and hence f{x) the same sign as OoX". 

The last result holds also when x is a negative number sufficiently large 
numerically. For, if we set x = —X, the former case shows that/(— X) 
has the same sign as ( — l)'*aoX" when X is a sufficiently large positive 
nmnber. 

We shall therefore say briefly that, for x = +oo, f{x) has the same 
sign as Oo; while, for x = — cao , f(x) has the same sign as Oo if n is even, 
but the sign opposite to Oo if n is odd. 

EXERCISES 

''I. J* -f ax^ -|- 6x — 4 = has a positive real root [use x = and x = +oo ]. 

^2. x^ -^ ax^ -^ bx -]- 4i = has a negative real root [use x = and x = —oo]. 
--3. If Oo > and n is odd, oor" + • • • + On = has a real root of sign opposite 
to the sign of an [use x= — cx), 0, +oo]. 
""4. x^-hox' + fex' + cx — 4 = has a positive and a negative root. 

""5. Any equation of even degree n in which the coefficient of x^ and the con- 
stant term are of opposite signs has a positive and a negative root. 

14. The accuracy of a graph of y = /(x) can often be tested and 
important conclusions drawn from it by use of the 

Theorem. No straight line crosses the graph of y = /(x) in more than 
n points if the degree n of the polynomial /(x) exceeds unity, 

A vertical line x = c crosses it at the single point (c, /(c)). A non- 
vertical line is the graph of an equation t/ = mx + 6 of the first degree, 
and the abscissas of the points of crossing are the roots of mx + 6 = /(x). 
The proof may now be completed by using the next theorem. 
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16. Theorem. An equation of degree n, 

f{x) = oox'* + aix'^-i + • • • + a„ = (oo 5^ 0), 

cannot Aare more than n distinct roots. 

Suppose that it has the distinct roots ai, . . . , a„, a. By the Factor 
Theorem (§ 7), x — ai is a factor of /(x), so that 

fix) = (x - ai) (2(x), 

where Q{z) is a polynomial of degree n — 1. Let x = aa. We see that 
Q(a2) = 0, so that as before 

Q(x) = (x - a2) Qi(x), fix) = (x - ai)(x - a2) Qi(x). 
Proceeding in this manner, we get 

fix) = aoix - ai)(x — a2) . . . (x — an). 

For the root a, the left member is zero and the right is not zero. Hence 
our supposition is false and the theorem true. 

EXERCISES 

^ 1. The curve in Pig. 3, representing a cubic function^ does not cross the x-axis 
at a second point further to the right, nor does the part starting from M' and 
running downwards to the left later ascend and cross the x-axis. 

2. The curve in Fig. 2, representing a quartic function, has only the four cross- 
ings shown. 

-3. Form the cubic equation having the roots 0, 1, 2. 
^4. Form the quartic equation having the roots ±1, ±2. 

5. If oox'* + • • • = has more than n distinct roots, each coefficient is zero. 
When would the theorem in § 14 fail if n = 1? 

^ 6. If two polynomials in x of degree n are equal for more than n distinct values 
of X, they are identical. 

N 7. An equation of degree n cannot have more than n roots, a root of multiplicity 
m being counted as m roots. 

16. Graphical Solution of a Quadratic Equation. If 

(12) x2 - ox + 6 = 

has real coefficients and real roots, the roots may be constructed by the 
use of ruler and compasses, i.e., by elementary geometry. 




16 THEORY OF EQUATIONS ICh. I 

Draw a circle having as a diameter the line BQ joining the points 
B = (0, l)and Q = (a, b); the abscissas ON and OM of the points of 

intersection of this circle with the x-axis are 
the roots of (12). 

The center of the circle is (a/2, (6 + l)/2). 
The square of BQ is a^ + (6—1)^. Hence the 
equation of the circle is 

('-i)+(»-^)"=f+(^)" 

Setting 2/ = 0, we get (12). 
Fig. 9 If we do not insist upon a solution by 

ruler and compasses, we may plot the par- 
abola y = x^ and draw the straight line y = oa: — 6; if these intersect, 
the abscissas of the points of intersection are the real roots of (12). 

17. The method last used enables us to solve graphically 

x' — ox + 6 = 0. 

We have merely to employ the abscissas of the intersections of the graph 
(Fig. 4) of y = x^ with y = ax — b. For the quartic equation 

z* + Az^ + Bz + C = 0, 
set2 = xVI; weget ^ + ^2_ax + b = 0. 

We now employ the graphs of y = x* + x^, y = ax — b. 

EXERCISES 

Solve by each of the two methods 

1. x^-5x-f-4 = 0. 2. x2 + 5x + 4 = 0. 3. x« + 5x-4 = 0. 

4. x* - 5x - 4 = 0. 5. x2 - 4x + 4 = 0. 6. x* - 3x + 4 = 0. 

Solve graphically the cubic equations 

7. x»-3x + l = 0. 8. x» + 2x-4 = 0. 9. x» - 7x -f 7 = 0. 

10. Find ^aphicaUy the cube roots of 20, -20, 200. 

11. State in the language of elementary geometry the construction of Fig. 9 
and prove that OC = TQ = b,TD = OB = I, chord BN = chord DM, ON = MT, 
ON -f OM =^a,ON'OM = OC'OB = b. Why are OM and ON the roots of (12)? 

12. Any reduced cubic equation a^ = px -{- q can be solved by use of a fixed 
parabola x* = y and the circle x^ -f !/* = ^x -f- (p -f- l)y, (Descartes.) 

13. X* = px^ + qx + r can be solved by use of a fixed parabola x* = y and the 
circle x* -f 2/^ = ^x -f (p -f 1)2/ -f r. (Descartes.) 

14. Solve the cubics in Exs. 7-9 by the method of Ex. 12. 

15. Solve X* = 25 x2 - 60 X + 36 by the method of Ex. 13. 
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18.t The approximate values of the real roots of a cubic equation 

^ + pz + q = 

may be found by a graphical method due to C. Runge.* We assign 
equidistant values to z. For each z, we have a linear equation in p and q 
which therefore represents a straight line when p and q are taken as rec- 
tangular coordinates. On a diagram showing these lines we may locate 
approximately the line (and hence the values of z) corresponding to 
assigned values of p and q. The method applies also to any equation 
involving two parameters linearly. 

For the solution of a numerical cubic equation by means of the slide 
rule (and an account of the use of the latter), see pp. 43-48 of the book 
jast cited. 

* Graphical Methods^ Columbia University Press, 1912, p. 59 (also, Praxis der 
Gleichungerif Leipzig, 1900, p. 156). 



CHAPTER II 



Complex Numbers 

(For a briefer course, this chapter may be begun with § 5.) 

1. 1 Vectors from a Fixed Origin 0. A directed segment of a straight 
line is called a vector. We shall employ only vectors from a fixed initial 
point 0, 
The sum of two vectors OA and OC is defined to be the vector OS, 

where S is the fourth vertex of the par- 
'^ allelogram having the lines OA and OC 

as two sides. In case A coincides with 0, 
the vector OA is said to be zero; then 
OS = OC. 




A force of given magnitude and given dir- 
ection is conveniently represented by a vector. 
By a fundamental principle of mechanics, two 
forces, represented by the vectors OA and OC, 
Fig. 10 have as their resultant a force represented by 

the vector OSj as in Fig. 10. Thus if two forces 
are represented by two vectors, their resultant is represented by the sum of 
the vectors. 

When referred to rectangular axes OX and OY, let the point A have the 
coordinates OE = a, EA = b, and the point C the coordinates OF = c, 
FC = d. Draw AG parallel to OX and SGH perpendicular to OX. Since 
triangles OFC and AGS are equal, AG = c, GS = d. Hence the coor- 
dinates of the point S are OH = a + c and HS = b + d. The sum of 
the vecUyrs from to the points (a, b) and (c, d) is the vector from to the 
point (a + c, 6 + d), whose coordinates are the sums of the corresponding 
coordinates of the two points. 

Subtraction of vectors is defined as the operation inverse to addition of 
vectors. If OA and OS are given vectors, the vector OC for which OA 
+ OC = OS is denoted by OS — OA, and is determined by the side OC 
of the parallelogram with the diagonal OS and side OA. 

18 
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2.t Multiplication of Vectors. Let A be a point {r, ^} with the polar 
coordinates r, 6. Then r is the positive number giving the length of the 
line OA, while 6 is the measure of the angle XOA when measured counter- 
clockwise from OX, as in Trigonometry. Let C be the point { r', 6' \ with 
the polar coordinates r', ^'. 

The product OA • OC of the vectors from to A = \r, 6\ and to C = 
Jr', ^'} is defined to be the vector from to P = \rr\e + e'\. 




Fig. 11 




To construct this product geometrically, let U be the point on the 
a:-axis one imit to the right of 0. Let the triangle OCP be constructed 
similar to triangle OUA, such that corresponding sides are OC and OC/, 
CP and UA, OP and OA, and such that the vertices 0, C, P are in the 
same order (clockwise or counter-clockwise) as the corresponding vertices 
0, U, A. Then OP : r' = r : 1, so that the length of OP is rr\ The 
angle XOP, measured counter-clockwise from OX, equals d + 0', and may 
exceed four right angles. Hence the product of the vectors OA and OC 
is the vector OP. 

If OC = OU, then OP = 0.4, and 0C7 • 0.4 = OA. Hence vector OU 
plays the r61e of unity in the multiplication of vectors. 

Division of vectors is defined as the operation inverse to multiplication 
of vectors. If OA and OP are given vectors, the vector OC for which 
OA'OC = OP is denoted by OP/OA. li A = \r, 6] and P = {n, ^i} then 
C = {ri/r, ^1 — ^}. Division except by zero is therefore always possible 
and unique. 

EXERCISES t 

l.t Vector addition is associative: {OA + OC) + OL = OA + (OC + OL). 
2.t V^tor multiplication is associative: {OA • OC) •OL^OA* {OC • OL). 
3.t Draw the figure corresponding to Fig. 12, when OA is in the third quadrant 
and OC in the first quadrant. 
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3.t Symbol for Vectors from 0. We consider only vectors starting 
from the fixed point 0. Such a vector OA is uniquely determined by its 
terminal point A = (a, b) and hence by the Cartesian coordinates a, b of 
the point A referred to fixed rectangular axes OX and OY. We may 
therefore denote the vector OA by the symbol [a, 6]. Then 

(1) [a, b] = [c, d] if and only if a = c, b = d. 

By the definition of addition and subtraction of vectors (§1), 

(2) [a, b] + [c, d] = [a + c, 6 + d], 

(3) [a, 6] - [c, d] = [a - c, 6 - d]. 

As our definition of the product of two vectors was made in terms of 
polar coordinates, we must now express the product in terms of Cartesian 
coordinates. By Fig. 11, we have 

a = r cos 6, b = rsinO. 

Similarly, if the point (c, d) has the polar coordinates r', d\ 

c = r' cos S\ d = r' sin 6\ 

Hence the definition (§2) of the product of two vectors gives 

[a, b] [c, d] = [rr' cos {6 + 6'), rr' sin {6 + d')l 

the final numbers being the Cartesian coordinates of the point with the 
polar coordinates rr' and 6 + S\ But 

rr' cos {e + d') = rr' (cos B cos ^' - sin B sin B') = ac — bd, 
rr' sm {B + B') = rr' (sin B cos B' + cos B sin B') =bc + ad. 

Hence, finally, 

(4) [a, b] [c, d] = [ac — bd, ad + be]. 

Given a, 6, e, /, we can find solutions c, d of the equations 

ac — bd = e, ad + be = f, 

provided a^ + b* 5^ 0, viz., a and b are not both zero. Then 

[a, 6] [c, d] = [e, /] 

determines [c, d], its expression being 

[e,/] ^ [ qg + y g/ - be l 
^^^ [a, 6] La' + ^'' a^ + b^y 

Hence division, except by the zero vector [0, 0], is always possible and 
unique. 
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4.t Introduction of Complex Numbers. Giving up the concrete in- 
terpretation in § 3 of the symbol [x, y] as the vector from the origin to 
the point (x, y), we shall now think abstractly of a system of elements 
[Xy y] each determined by two real numbers x, y, and such that the sys- 
tem contains an element corresponding to any pair of real numbers. 
While the present abstract discussion is logically independent of the 
earUer exposition of vectors, yet we shall be guided in our present choice 
of definitions of addition, multiplication, etc., of our abstract symbols 
[^} y] by the desire that the vector system shall furnish us a concrete 
representation of the present abstract system. Accordingly, we define 
equality, addition, subtraction, multiplication and division of two ab- 
stract elements [x, y] by formulas (l)-(5). In particular, we have 

[a, 0] zb [c, 0] = [a =b c, 0], 

K 0] [c, 0] = [ac, 0], ]^ = [|.o], 

provided a 5«^ in the last relation. Hence the elements [x, 0] combine 
under our addition, multiplication, etc., exactly as the real numbers x 
combine under ordinary addition, multiplication, etc. We shall there- 
fore introduce no contradiction if we now impose upon our abstract 
system of elements [x, t/], subject to relations (l)-(5), the further condi- 
tion that the element [x, 0] shall be the real number x. Then, by (4), 

[0,1] [0,1] = [-1,0]= -1. 

We write i for [0, 1]. Hence i^ = —1. Then 

[x, y] = [x, 0] + [0, 2/] = X + [2/, 0] [0, 1] = X + yi. 

The resulting symbol x + yi is called a complex number. For y = 0, it 
reduces to the real number x. For t/ ^^ 0, it is also called an imxiginary 
number. The latter is not to be thought of as unreal in the sense that 
its use is illogical. On the contrary, x + yi is a convenient analytic rep- 
resentation of the vector from the origin to the point (x, y), and the sum, 
product, etc., defined above, of two such complex numbers then repre- 
sent those simple combinations of the two corresponding vectors (§§ 1, 2) 
which are constantly used in the applications of vectors in mechanics and 
physics. Since these vectors from are uniquely determined by their termi- 
nal points, we obtain a representation (§8) of complex numbers by points 
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in a plane, a representation of great importance in mathematics and its 
applications. 

If in (l)-(5), we replace the symbol [a, 6] by a + 6i, etc., we obtain the 
formulas given in § 5. 

5. Formal Algebraic Definition of Complex Numbers. The equa- 
tion x^ = — 4 h as n o real root, but is said to have the two imaginary roots 
V^ and — V— 4. We shall denote these roots by 2 i and —2 1, agree- 
ing that i is a definite number for which v^ =—1. Similarly, we shall 
write VS i in preference to V— 3. If p is positive, Vp is used to denote 
the positive square root of p. 

If a and b are any two real numbers, a + bi is called a complex number 
and a — bi its conjugate. Two complex numbers a + bi and c. + di are 
called equal if and only if a = c, 6 = d. Thus a + 6i = if and only if 
a = 6 = 0. 

Addition of complex numbers is defined by 

(a + bi) + (c + di) = (a + c) +{b + d)i. 

The inverse operation, called subtraction, consists in finding a complex 
number z such that (c + di) -{• z = a + bi. In notation and value, z is 

[fl + bi) — ic + di) = (a — c) + (6 — d)L 

Multiplication is defined by 

(a + bi)(c + di) = {ac — bd) + (ad + bc)i, 

and hence is performed as in formal algebra with a subsequent reduction 
by use of i^ = — 1. If we replace 6 by —6 and d by — d, the right member 
is replaced by its conjugate. Hence the product of the conjugates of two 
complex members equals the conjugate of their product. 

Division is defined as the operation inverse to multiplication, and con- 
sists in finding a complex number q such that (a + bi)q = e + fi. Mul- 
tiplying each member by a — 6i, we find that q is, in notation and value, 

e +fi __ (e +fi){a — 60 _ ae -\-bf af — be . 
M^' a^ + b" a2 + 62 ■+• ^^+52^- 

Since a^ + 6^ = implies a = b = when a and 6 are real, division except 
by zero is possible and unique. 
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6. The Cube Roots of Unity. The roots of x* = 1 are unity and the 
numbers for which 

X — 1 
Hence the three cube roots of unity are 1 and 

EXERCISES 

^ 1. Verify that «' = w^, coo,' = 1, w^ + « + 1 = 0, «' = 1. 

^ 2. The sum and product of two conjugate complex numbers are real. 

^3. Express as complex numbers 

3 + 5i a + bi 3 + V^^ 
2~3i' a-bi' 2 + ^/Zr[' 

^4. If x, y, 2 are any complex-numbers, . A -/i m • 7 * 

xy = yx/ (xy)z = x(yz)y iiy^^z^^^^^^^xz^ . -^-a-ma* fq^n^/ZiMt^ 

What is the name of the property indicated by each equation? •^ 

5. If the product of two complex numbers is zero, one of them is zero. 
C.t Deduce the laws in § 5 from those inj^. 

7. Square Roots of a + hi found Algebraically. Given the real num- 
bers a and b, b j^ 0, we seek real numbers x and y such that 

a + hi = {x + yiy = x^ — y^ + 2 xyi. 
Thus 

a? — y^ = Uf 2 xy = 6, 

(3? + 2/2)2 = (3.2 _ y2y + ^xY = a' + 6^. 
Since x and y are to be real and hence x^ + y^ positive, 

x^ + y^ = Va^ + 62, 

the positive square root being the one taken. Combining this equation 
with X* — t/2 = a, we get 

, Va^ + b^ + a , V^~+^-a 
^ = n ' y = n 
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Since these expressions are positive, real values of x and y may be found. 
The two pairs x, y for which 2xy = b give the desired two complex num- 
bers X + yi. 

It is not possible to find the cube roots of a general complex number by 
a similar algebraic process (Ch. Ill, § 6). 

EXERCISES 

Express as complex numbers the square roots of 

^1. -7 + 24 1. "2. -ll + 60i. ^3. 5-12i. 

-4. 4c(i-f-(2c2-2d2)i. -5. c» - <? - 2 V - c^tf^. 

8. Geometrical Representation of Complex Numbers. Using rec- 
tangular axes of coordinates, we rep resent* a + 6i by the point A = (a, 6). 
The positive number r = Va^ + b^ giving the length of OA is called the 
modulus (or absolute value) of a + bi (Fig. 11). The angle 6 = XOA, 
measured counter-clockwise from OX, is called the amplitude (or argument) 
of a + bi. Thus 

(6) a + bi = r(cos 6 + i sin 6). 

The second member is called the trigorurmetric form of a+ bi, 

■ 

If c + di is represented by the point C, then the sum of a + bi and 
c + di is the complex number represented by the point S (Fig. 10) 
determined by the parallelogram OASC. Since OS = OA + AS, the 
modulus of the sum of two complex numbers is equal to or less than the sum 
of their moduli. 

For example, the cube roots of unity are 1 and 

= cos 120° + i sin 120°, 

co2= -^-iV3i 

= cos 240° + i sin 2^0°, 
Fig. 13 

and are respresented by the points marked 1, w, «^ in Fig. 13. They form 

* It will be obvious to the reader who has not omitted §§ 1-4 that the present rep- 
resentation is essentially equivalent to the representation of a + &i by the vector from 
O to the point (a, 6). 
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the vertices of an equilateral triangle inscribed in a circle of unit radius 
and center at the origin 0. 

9. The product of the complex number (6) by r'(cos a + i sin a) is 

rr' [cos {d + a)+ i sin (6 + a)], 
since 

(7) (cos ^ + i sin ^)(cos a + i sin a) = cos (0 + a) + i sin (^ + a). 

The latter follows from 

cos cos a — sin ^ sin a = cos {B + a), 
cos ^ sin a + sin ^ cos a = sin (^ + a). 

Hence the modvlus of the 'product of two complex numbers equals the product 
of their moduli^ and the amplitude of the product equals the sum of their 
amplitudes. 

The product may be found geometrically as in Fig. 12. 

For the special case a = ^, (7) becomes 

(cos ^ + I sin ^)2 = cos2^ + isui2^. 

This is the case n = 2 of formula (8). In particular, we see why the 
amplitude of w^ is 240° when that of w is 120° (end of § 8). 

10. De Moivre's Theorem. // n is any positive integer ^ 

(8) (cos ^ + i sin BY = cos n^ + i sin vB, 

This relation is an identity if n = 1 and was seen to hold if n == 2. 
To proceed by mathematical induction, let it be true if n = m. Using 
(7) for a = mBy we then have 

(cos ^ + i sin 0)*"+^ = (cos ^ + i sin 6) (cos -{-i sin B^ 
= (cos ^ + i sin 0)(cos mB + i?m mB) = cos (m + i)B + i sin (m + 1)B. 

Hence (8) is true also if n = m + 1. The induction is thus complete. 

Since cos e + i sin e represents the vector from the origin to the point { 1, dj , 
given in polar coordinates, its nth power represents (§2) the vector from to the 
point U, nd} and hence is cos nd + i sin nd. 
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11. Cube Roots. To find the cube roots of a complex number, we 
first express it in the trigonometric form (6). For example, 

4 v^ + 4 V2i = 8 (cos45° + isin45°). 

If it has a cube root of the form (6), then, by (8), 

r' (cos3^ + ism3^) = 8 (cos45° + isin45°). 

Their moduli r* and 8 must be equal, so that the positive real number r 
equals 2. Since 3 6 and 45° have equal cosines and equal sines, they differ 
by an integral multiple of 360°. Thus 

^ = 15° + A; . 120° {k an integer). 

Since in (6) we may replace ^ by ^ + 360° without changing a + 6i, we ob- 
tain just three distinct cube roots (given by fc = 0, 1, 2): 

2 (cos 15° + 1 sm 15°), 2 (cos 135° + i sm 135°), 2 (cos 255° + i sm 255°). 

EXERCISES 

^1. Verify that the last two numbers equal the products of the first number by 

w and o)', given at the end of § 8. 

N 2. Find the three cube roots of —27; those of — i. 

"^ 3. Find the three cube roots of — i + i Vs L 

12. nth Roots. Let p be a positive real number. As illustrated in 
§ 11, it is evident that the nth roots of p (cos A + i sin A) are the prod- 
ucts of the nth roots of cos A + i sin A by the positive real nth root of 
p. Let an nth root of cos A + i sin A be of the form (6). Then, by (8), 

r'*(cos nd + I sin rud) = cos A + i sin A. 

Thusr* = 1, r = 1, and ?i^ = A + fc • 360°, where fc is an integer. Thus 
n distinct nth roots of cos A + i sin A are given by 

,., A +A:.360° , . . A + fc.360° ,, ^ , .. 

(9) cos htsm (A; = 0, 1, . . . , n — 1), 

n n 

whereas k = n gives the same root as fc = 0, and fc = n + 1 the same 
root as fc = 1, etc. Hence any number 9^ has exactly n distinct nth com- 
plex roots. 

EXERCISES 

L Find the five fifth roots of — 1. 

2. Find the nine ninth roots of 1. Which are roots of x* = 1? 

3. Simplify the trigonometric forms of the four fourth roots of unity. Check 
the result by factoring x* — 1. 
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COMPLEX NUMBERS 



27 



13. Roots of Unity. By (9) the n distinct nth roots of unity are 
(10) 



2 ACT , . . 2 n^T /I f\ t IN 

COS + tsm— (« = 0, 1, . . . , n— 1), 



n 



n 



where now the angles are measured in radians (an ang'e of 180 degrees 
equals v radians, where t = 3.1416, approximately). For fc = 0, (10) 
reduces to 1, which is an evident nth root of unity. For fc = 1, (10) is 



(11) 



27r , . . 27r 

r = cos h t sm — 

n n 



By DeMoivre's Theorem (§10), the general number (10) equals the fcth 
power of r. Hence the n distinct nth roots of unity are 



(12) 



r, r^, 7^, . . . , r"~S r" = 1. 



The n complex numbers (10), and therefore the numbers (12), are rep- 
resented geometrically by the vertices of a regular polygon of n sides 
inscribed in the circle of radius unity and center, at the origin with one 
vertex on the x-axis (Fig. 14). 




n-:l 




—x 



Fig. 14 



Fig. 15 



For n = 3, the numbers (12) are «, w^, 1, shown in Fig. 13. 

For n = 4, we have r = cos t/2 + i sin 7r/2 = i. The fourth roots of 
unity (12) are i, i^ = —1,1^= — i, i* = 1. These are represented by the 
vertices of a square inscribed in a circle of radius unity (Fig. 15). 

EXERCISES 

^ 1. For n = 6, r = — w*. The sixth roots of unity are therefore the three cube 
roots of unity and their negatives. Check by factoring x* — 1. 

2. From the point representing a + hi how do you obtain that representing 
— (a + 6i)? Hence derive from Fig. 13 and Ex. 1 the points representing the six 
sixth roots of unity. 

3. Which powers of a ninth root (11) of unity are cube roots of unity? 
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14. Primitive nth Roots of Unity. An nth root of unity is called 
primitive if no power of it, with a positive integral exponent less than n, 
equals unity. Since only the last one of the numbers (12) equals unity, 
the number r, given by (11), is a primitive nth root of unity. 

For n = 4, both i and — i are primitive fourth roots of unity, while 
1 and —1 are not. Just as i^ = —1 and i* = +1 are not primitive fourth 
roots of unity, so r* is not a primitive nth root of unity if k and n have a 
common divisor d (d > 1). Indeed, 

n k^ 

(r*)d = (r")d= 1, 

whereas n/d is a positive integer less than n. But if k and n are relatively 
prime, i.e., have no common divisor exceeding unity, r* is a primitive nth 
root of unity. To prove this, we must show that (r*)' 9^ 1 \i I is a. posi- 
tive integer less than n. Now, by De Moivre's Theorem, 

., 2klT , . . 2klw 

r** = cos h I sm • 

n n 

If this were unity, 2 klir/n would be a multiple of 2 t, and hence kl a 
multiple of n. Since A* is relatively prime to n, the second factor I would 
be a multiple of n, whereas < Z < n. Hence the primitive nth roots of 
unity are those of the numbers (12) whose exponents are relatively prime to n. 

EXERCISES 

1. The primitive cube roots of unity are w and w'. 

2. For r given by (11), the primitive nth roots of unity are (i) for n = 6, r, r*; 
(ii) for n = 12, r, r«, r\ r". 

3. For n a prime, any nth root of unity, other than 1, is primitive. 

4. If r is a primitive loth root of unity, r', r*, r', r" are the primitive 5th roots 
of unity, and r*, r*° are the primitive cube roots of unity. Show that their 8 prod- 
ucts by pairs give all of the primitive loth roots of unity. 

5. If n is the product of two primes p and g, there are exactly (p — 1)(7 — 1) 
primitive nth roots of unity. 

6. If p is any primitive nth root of unity, p, p', p^ . . . , p** are distinct and give 
all of tbe nth roots of unity. Of these, p* is a primitive nth root of unity if and 
only if k is relatively prime to n. 

16. Imaginary Roots Occur in Pairs. The roots oix* + 2cx + d = 
are 



(13) "C+Vi^-d, -c-Vc'-d. 
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If c and d are real, these roots are both real or are conjugate imaginaries. 
The latter ease illustrates the following 

Theorem. // a and b are real numbers, b 7^ 0, and if a + bi is a root 
of an equation with real coeffidentSy then a — bi is a root. 
Let the equation be f(x) = 0. Divide f{x) by 

(14) {x-ay + b^=(x-a- bi){x - a + U)- 

until we reach a remainder rx + s of degree less than the degree of the 

divisor in x. Evidently r and s are real. If the quotient is Q(x), we 

have 

fix) = Q(x) \{x-aY + V\+rx + s, 

identically in x (Ex. 6, p. 15). Let x = a + bi. Since this is a root of 
fix) = 0, we see that 

= r(a + W) + s, = ra + s, = rb. 

Since 6 5*^ 0, we have r = and then s = 0. Thus/(x) has the factor (14), 
so that fix) = has the root a — bi, 

16. t Generalization of the theorem in §15. The sum of the roots 

(13) o{x^ + 2cx-{-d = equals the negative of the coefficient 2 c of x, 

and their product equals the constant term d. It follows that 2 + i and 

—2 are the roots of 

z2-iz-4-2i = 0, 

and that 2 — i and —2 are the roots of 

z' + iz-^ + 2i = 0. 

We have here an illustration of the following 

Theorem. // a and b are real numbers and ifa + bi is a root offiz) = 0, 
then a — bi is a root of giz) = 0, where giz) is obtained from the polynomial 
fiz) by replacing each coefficient c + di by its conjv^gate c — di. 

Consider any term (c + di)s^ of fiz). Replace z hy x + yi, where x 
and y are real. The term 

ic + di)ix + yi)^ 

oifix + yi) has as its conjugate imaginary the product 

(c — di) (x — yi)^ 
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of the coujugates of the factors of that term (§5). But the new product 
is a term of g(x — yi). Hence the latter is the conjugate A — Bi of 
f{x + yi) = A + Bi, where A and B are polynomials in x and y with 
real coefficients. 
Take x = a, y=b. Then A=B=Ohy hypothesis. Hence g{a—bi) = 0. 

EXERCISES 

l.f The theorem in § 15 is a corollary to that in § 16. 

- 2. Solve a:»-3x»-6a:-20 = 0, with the root -1 + V^, 

- 3. Solve x*-4a:» + 5x*-2a;-2 = 0, with the root 1 - i. 

-^ 4. Find the cubic equation with real coefficieats two of whose roots are 1 and 
3 + 2 i. 

5.t Given that x* + (1 — i)x'^ -f- 1 = has the root i, find a cubic equation 
with the root — t. Form an equation with real coefficients whose roots include 
the roots of these two cubic equations. 

^6. If an equation with rational coefficients has a root a + Vd, where a and b are 

rational, but y/b is irrational, it has the root a — y/b. [Use the method of § 15.] 

^7. Solve x^-^x^ + Ax-l^O, with the root 2 + Vs. 

•^8. Solve x* - (4 + >/3)x« + (5 + 4 V3)x - 5 Vi = 0, with the root Vz. 
" 9. Solve the equation in Ex. 8, given that it has the root 2 + i. 

^ 10. What cubic equation with rational coefficients has the roots J, J + V2 ? 



CHAPTER III 
Algebraic and Trigonometbic Solution of Cubic Equations 

1. Reduced Cubic Equation. If in the general cubic equation 

(1) a:» + 6x2 + ex + d = 0, 

we set X = J/ — 6/3, we obtain a reduced cubic equation 

(2) y' + py + q^O, 

where 

/ON ^ ^ 6c , 26» 

(3) P = c-3, g = d-- + _. 

A geometrical interpretation of this process was given in Ex. 5, p. 10. 
We shall find the roots 2/1, 2/2, Vz of (2). Then the roots of (1) are 

/A\ 6 6 6 

(4) Xi = 2/1 - g> X2 = 2/2 - g> aJs = 2/8 - g- 

2. Algebraic Solution of Cubic Equation (2). We shall employ a 
method essentially that given by Vieta * in 1591. We make the substi- 
tution 



P 
« in (2) and obtain 



(5) y^z-£ 



a»-J^ + ff = 0. 



27 
Multiplying each member by 2*, we get 

(6) 2« + 32»-g = 0. 

Solving this as a quadratic equation for z*, we obt^n 

* Opera Math., IV, published by A. Anderson, Paris, 1615. 

31 
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By Ch. II, § 11, any number has three cube roots, two of which are the 
products of the remaining one by 

(8) co=-i+^V3i, ««=-^-^V3i. 

Since 

(-l+^X-l-v^) -(-!)■ 

we can choose particular cube roots 



(9) 



a = \/-^ + Vr, 5 = y/-|-Vfi, 



such that AB = — p/3. .Then the six values of z are 

A, (aA, (o^A, By cojB, uy^B. 

These can be paired so that the product of the two in each pair is — p/3: 
AB = -p/3, u)A . oy^B = -p/3, o)^A • wS = -p/3. 

Hence with any root z is paired a root equal to — p/(3 z). By (5), the 
sum of the two is a value of y. Thus the three roots of (2) are 

(10) 2/1 = A + B, 1/2 = coA + CO^B, 1/3 = coU + coS. 

These are kno^^^l as Cardan's formulas for the roots of a reduced cubic 
equation (2). The expression A + B ior a root was first published by 
Cardan in his Ars Magna, 1545, although he had obtained it from Tartaglia 
under promise of secrecy. 

EXERCISES 

1. For y* - 15?/ - 126 = 0, y = z + 5/z and 

«• - 1262^ + 125 = 0, 2» = 1 or 125, z = 1, w, «', 5, 5 «, 5 «*. 

The first three zs give the distinct y's: 6, w + 5 w', w' + 5 w. 

2. SoWer/- ISy + 35 = 0. 3. Solve j' + 6 j' + 3x + 18 = 0. 
4. Solve 1/ - 2 7/ + 4 = 0. ■ .'>. Solve 28x» + Ox* - 1 = 0. 

6. Using w^ + w + 1 = 0, show from U^) that 

!/i + 2/2 + !/3 = 0, ?/i7/2 + 2/11/3 + 2/22/8 = p, 2/12/4/1 = — 5- 

7. By (3), (4) and Ex. 6, show that, for the roots of (1), 

Xi + J2 + xi = -6, xixi + XiXi + x»rj = c, XiXjXj - —A 
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3. Discriminant. By (10) and co^ = 1, 

2/1 - 2/2 = (1 - w)(A - (o^B), 

2/1 - 2/3 = -w2 (1 - U)){A - wS), 

2/2 - 2/8 = w (1 - «)(-4 - ^)- 
To form the product of these, note that w^ = 1 and, by (8), 

(1 - co)3 = 3 (co2 - co)'= -3 V3i. 

Since the cube roots of unity are 1, w, co^, we have 

X' — 1 = (X — 1)(X — W)(X — 0)2), 

identically in x. Taking x = A/B, we see that 

(11) A3 - B3 = (A - B){A - coB)(A - a)2S). 
The left member equals 2 VR by (9). Hence 

(12) (2/1 - 2/2) (2/1 - 2/3) (2/2 - 2/3) = 6 V3 Vfi i. 

The product of the squares of the differences of the roots of any equation 
in which the coeflScient of the highest power of the variable is unity shall be 
called the discriminant of the equation. Thus the discriminant is zero if 
and only if two roots are equal, and is positive if all the roots are real. 

In view of (12) the discriminant A of the reduced cubic equation (2) 
has the value 

(13) A = -108i2= -4p3-27g2. 

By (4), Xi — X2 = 2/1 — 2/2, etc. Hence the discriminant of the general 
cubic (1) equals the discriminant of the corresponding reduced cubic (2). 
By (3) and (13), 

(14) A = 186cd - 463d + 6V - ^<^^27(P. 

It is sometimes convenient to employ a cubic equation 

(15) ax^ + bx^ + cx + d = 0, 

in which the coefficient of x^ has not been made unity by division. The 
product P of the squares of the differences of its roots is evidently derived 
from (14) by replacing 6, c, d by 6/a, c/ay d/a. Thus 

(16) a'P = ISabcd - ^b^d -f 6V - 4ac' - 27 a^cP. 

This expression (and not P itself) is called the discnminant * of (15). 

* Some writers define — ^V ^^^ to be the discriminant of (15) and hence — ^ A as 
that of (1). On this point see Ch. IV, § 4. 
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4. Theorem. A cubic equation with real coefficients has three distinct 
real roots, a single real root, or at least two equal real roots, according as its 
discriminant is positive, negative or zero. 

It suffices to prove the theorem for a reduced cubic equation (2) in 
which p and q are real. First, let A ^ 0. By (13), 72 ^ 0. Using (8), 
we find that the roots (10) are 

(17) A+B, -iU+B)±i(A-B)V3i. 

But A and B, in (9), may now be taken to be real, since R = 0, 
li R > 0, A 7^ B and A + B is the only real root. If /2 = 0, then 
A ^ B and the roots are real and at least two are equal. 

Next, let A > 0, so that R <Q. Since —\q+ VS is an imaginary num- 
ber it has (Ch. II, § 11) a cube root of the form A = a + jSi, where a and 
/8 are real and fi 7^ 0, Then (Ch. II, § 16) S = a — fii is a cube root of 

— J g — y/R. For these cube roots, the product AB is real and hence 
equals — p/S, as required in § 2. Hence 

yi = 2a, 2/2=-«-i3V3, y, = -a + /3V3. 
These real roots are distinct since A 7^ 0. 



EXERCISES 

Find by means of A the number of real roots of 

1. y»- 152/ + 4 = 0. 2. 2/»-272/ + 54 = 0. 3. x» + 4a:* - llx + 6 = 0. 
' 4. Using A = (xi — 0:2)* {xi — xzY (x2 — Xj)', show that, if Xi and xj are con- 
jugate imaginaries and hence Xi real, A < 0; if the x's are all real and distinct, 
A > 0. Deduce the theorem of § 4. 

5. Deduce the same theorem from Ch. I, § 9. 

6. Irreducible Case. When the roots of a cubic equation are all real 
and distinct, R is negative (§ 4), so that Cardan's formula present their 
values in a form involving cube roots of imaginaries. This is called the 
irreducible case.* We shall derive modified formulae suitable for numer- 
ical work. Since any complex numl)er can be expressed in the trigono- 
metric form, we can find r and such that 

(18) -Iq+VR = r{cose + isinS), 

* This term is not to be confused with "irreducible equation." 
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In fact, the conditions for this equality are 

— i g = r cos ^, R = — r* sin* $. 
Hence _ . 

r^^r" (cos*^ + sin^d) = ig» - /e = -^, 



(19) ^=V/^' ^*^--2^-^v/^- 

Since /E is negative, p is negative and r is real. Since R < 0, the value 

(19) of cos d is numerically less than unity. Hence 6 can be found from 
a table of cosines. 

The complex number conjugate to (18) is 

(20) -ig - v^ = r(cos^ - isin^). 
The cube roots of (18) and (20) are 

. A^r g + m>360^ . . . g + m > 360^ , ^ i ox 

y-y- cos g rtism g (m = 0, 1, 2). 

For a fixed value of m the product of these two numbers is — p/3. Hence 
their sum is a root of our cubic equation. Thus if R is negative, the three 
distinct real roots are 

(21) 2>J=fcosi±^!^ (»» = 0,1,2). 

EXERCISES 

1 . Solve the cubics in Exs. 1, 2, page 34. 

2. Solve y» - 2y - 1 = 0. 3. Solved - 7y + 7 = 0. 
^. 4. Find constants r and s such that 

y^ + py + q ^ \r {y + sy-s(y + r)»} 

r — 8 

identically in y. Hence solve the reduced cubic equation. 

6.t Algebraic Discussion of the Irreducible Case. Avoiding the use 
of trigonometric functions, we shall attempt to find algebraically an 
exact cube root z + yi of a + hi, where a and h are given real numbers, 
5^0. We desire real numbers x and y such that 

(x + yiy = a + 6i, 
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whence a? — 3xy^ = a, Soi^y — y^ ^b. 

Thus y 7^ and we may therefore set x = sy. Hence 

(83-35)2/3 = 0, (3s2- 1)2/3=6 

Eliminating y^, we get 

^-^^-'38 + 1 = 0. 





Set 8 = t + a/b. We obtain the reduced cubic equation 

e'-3kt-2zk==0 





The R of (7) is here — k^. Thus Cardan's formulse for the roots t involve 

\\Tiile the first factor is the cube root of a real number, the second is 
exactly the cube root which we started out to find. 

Hence this algebraic process in conjunction with that in § 2 fails to give 
us the real roots of our cubic equation. Conceivably other algebraic proc- 
esses would succeed; but it can be proved * rigorously that a cubic equation 
with rational coefficients having no rational root, but having three real 
roots, cannot be solved in terms of real radicals only. Hence there does 
not exist an algebraic process for finding the real values of the roots in 
the irreducible case. 

A cube root of a general complex number cannot be expressed in the 
form X + yiy where x and y involve only real radicals. For, if so, Cardan's 
formulae could be simplified so as to express the roots of any cubic equa- 
tion in terms of real radicals only. 

7. t Trigonometric Solution of a Cubic Equation with A > 0. In the 

irreducible case we may avoid Cardan's formula* and the simplifications 
in § 5. The same final result.s are now obtvained by a direct solution based 
upon the well-known trigonometric identity 

cos 3 X = 4 cos* X — 3 cos x. 

* H. Weber and J. Wellstcin, Encyklopddie dcr Elementar-McUhenuUik, I, ed. 1, p. 325; 
ed. 2, p. 373; ed. 3, p. 364. 
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This may be written ia the form 

2* — f z — J cos 3 X = (z = coax). 

To transform cubic (2) into this one, set y = rw. Thus 
The two cubic equations are identical if 



Since iZ < 0, p < and the value of cos 3 x is real and numerically < 1. 

Hence we can find 3 x from a table of cosines. The three values of z are 

then 

cos X, cos {x + 120°) , cos {x + 240°) . 

Multiplying these by n, we get the three roots y. 

Example. For y* — 22/— 1 = 0, we have 

n« = 8/3, 0083 x= V27/32, 3x = 23°17'0", 

cos X = 0.99084, cos (x + 120°) = -0.61237, cos (x + 240°) = -0.37847, 

y = 1.61804, - 1, - 0.61804. 

EXERCISESt 

Solve by the last method 

1. 2/'-7y + 7 = 0. 2. x»-f3x«-2x-5 = 0. 

3. x» + x2-2x-l = 0. 4. x» + 4x«-7 = 0. 

5. The cubic for ^ in § 6 has three real roots; in just three of the nine sets of 
solutions X, y, both are real. 



CHAPTER IV 
Algebraic Solution of Quartic Equations 

1. Ferrari's Method. Writing the quartic equation 

(1) tt^ + b3? + cx^ + dx + e = 

in the equivalent form 

(x" + ibxy = iiV " c)x' - dx - e 

and adding (x^ + i bx)y + \y^ to each member, we get 

(2) (x' + hbx + hyy ^ {\¥ - c + y)^? + (iby - d)x + i y* -- e. 

We seek a value yi of y such that the second member of (2) shall be the 
square df a linear function of x. For brevity, write 

(3) 62 _ 4c + 42/1 = ^. 

We here assume that t t^ {qf. Exs. 3, 4, p. 40). We therefore desire 
that 

(4) \e2? + (ibyi-d)x + iyi'-e^ {^^ + ^^\^^ '' 
The condition for this is that the terms free of x be equal: 

(5) ivr^-e^ (Jbyi-dy 
^^^ *^' ^ 6«-4c + 4i/i 

Hence yi must be a root of the resolvent cubic equation 

(6) y» - C2/* + (6d - 4 e)2/ - 6^6 + 4 cc - cP = 0. 

After finding (Ch. Ill) a root t/i of this cubic equation, we can easily 
get the roots of the quartic equation. In view of (2) and (4), each root 
of the quartic equation satisfies one of the quadratic equations 

^^ \^ + \Q> + t)x + hyx + (i hyx - d)/t = 0. 

38 
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EXERCISES 

1. Forx* + 2x» - 12 x« - lOx + 3 = 0, show that (6) becomes 

y* + 12 y* — 32 2/ — 256 = 0, with the root yi= —4, and that (7) then become 

x« + 4x-l=0, x»-2x-3 = 0, 

with the roots -2 db V5; 3, —1. 

-2. Solve x*-2x»-7x« + 8x + 12 = 0. 
3. Solve X* - Sx* + 9x2 + 8x- 10 = 0. 

2. Relations between the Roots and Coefficients. Let Xi and xz be 
the roots of the first quadratic equation (7), Xz and Xa those of the second. 
The sum and product of the roots of a^ + Ix + m = are —I and m 
respectively (Ch. II, § 16, or Ch. VI, § 1). Hence 



(8) 



xi-hxt= -i(b-t), XiXt = iyi- {^byi-d)/t, 
,Xz + Xa=^ -H& + 0, ^'^^ = § 2/1 + (i &yi -d)/t. 



Using also (5), we find at once that 
(9) xi + X2 + Xz + X4=^ -6, X1X2X8X4 = i 2/1^ - (i 2/1* -e) = c, 

(10) XiX2 + XiXz + XiXA + 3C2Xz + X2Xi+XsPCi = XiX2+{Xi+X2) (X8+X4) +XaX4 = C, 

(11) X1X2XZ + X1X2X4 + XiXiXi + XsX8X4 = 3:1X2 (Xs + X4) + XsX4(Xi + X2) = — d. 

It follows from Ex. 3, p. 40 that (9)-(ll) hold also when there is no root 
2/1 for which t9^0. 

For any quariic equaiion (1), the sum of the roots is —bj the sum of the 
frroducts of the roots two at a time is c, the sum of the prodixts three at a time 
is —d, the product of all four is e. 

A proof based upon more fundamental principles is given in Ch. VI, § 1. 

3. Roots of the Resolvent Cubic Equation. These are 
(12) yi = X1X2 + X3X4, 2/2 = XiXi + X2a:4, Vz = X1X4 + x^. 

The first relation follows from (8). If, instead of 2/1, another root of (6) 
be employed as in § 1, quadratic equations different from (7) are ob- 
tained, such however that their four roots are Xi, X2, X3, X4, paired in a new 
way. This leads us to expect that 2/2 and i/s in (12) are the remaining 
roots of cubic (6). To give a formal proof, note that, by (9)-(ll), 
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(13) 



2/1 + 2/2 + 2/8 = c, 

!/il/2+!/a/t+2/22/t = (a:i+iC2 + Xz + Xi) (xiXjXs H h XiXz;t^ — 4 XiXiPC^Xi 

= M-4e, 

2/0/22/1 = (aJiXjXs +•••)* + a;irr2a:sX4K^i + • • 0^ - 4 (xiX2+ •••)!' 
= (P + e(62-4c). 



Hence by Ex. 7, p. 32, or by Ch. VI, §1, 2/i> 2/2> 2/» are the roots of (6). 

EXERCISES 

1. Why is it sufl&cient for the last proof to verify merely the first two relations 
(13)? 

^-2. In Lagrange's solution of quartic (1), we begin by showing that the num- 
bers (12) are the roots of cubic (6) by using (13) and the theorem of § 2. Let a 
root 2/1 be found. Then we obtain 0:1X2 = Zi and 2:3X4 = ztas the roots of 2* — yiz 
+ e = 0. Next, Xi + xj and xj + X4 are found from 

(xi -f- Xt) + (xi + X4) = - 6, 2j(xi + xi) + zi(xj + X4) = -d. 

Hence Xi and xj, Xj and X4 are found by sohdng quadratic equations. Give the 
details of this work. 

"3. If the t corresponding to each root of (6) is zero, equation (1) has all its 
roots equal. For, by (3), the t/'s all equal c — J 6*. By (13), 3 yi = c, 3 yi* = 
W — 4 c. Hence c = 1 6*, A ^* = W — 4 e. Eliminating e between the latter 
and (i 6*)' = t/i' = 6*e — 4 cc + cP, which follows from 2/1 = c — J 6* and (13), we 
get (^K 6» - d)' = 0. Then (1) equals (x + i 6)*= 0. 

4. Prove that Ex. 3 is true by showing that ^ = (xi + xj — xs — X4)'. 

5. Solve x* + px + ^ = (p ?^ 0) by choosing c so that the quartic 

(x - c)(x' + px + g) = 

shall have as its resolvent cubic (6) one reducible to the form «• = constant. Here 
(6) is 

2/* - P2/* + c(cp + 3q)y - c^^ - 2cpq - ^ + c*q = 0. 

To remove the second term, set y = 2 + p/3. We get 

«» + Az + c»g - i c*^p* - cpg - g« - iV P* = 0, 

where A == pc^ + Scq — Jp*. We are to make A = 0; thus 

ipc^-iq + VR, R = t+^, 

«»=: -q(c» + cp + q) + SR = (^y/Rji- \q+ Vr), 
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since c* + cp + 5 = 36 cR/p*. Our quartic has the root c and hence by (81), 
with b replaced by — c, also the root i(c + 1) — c, where <' = c'--4p + 4 2/. 
Hence the given cubic has the root 



i(«~c)= V^-Jp + ic^-ic 
which may be reduced to Cardan's form {Amer, Math, Monthly, 1898, p. 38). 

4. Discriminants. Replacing y hy Y + c/3 in (6) , we get 

(14) P + PF + Q = 0, 
in which 

(15) P = 6d-4e-Jc2^ Q ^ -b^e + lbcd + ^ce-cP - ^ c^. • 
Hence (Ch. Ill, § 3), 

(2/1 - 2/2)2(2/1 - 2/»)'(2/2 - 2/3)' = -4P« - 27 Q2. 
By (12) 

yi - 2/2 = (^1 - a;4)(x2 - xz)y 

(16) yi - 2/8 = (a^i - ^z)(^ - icO, 

2/2 - 2/8 = (ici - a^)(aJ8 - 0:4). 

The discriminant A of the quartic (1) is defined to be 

(17) A = (xi - X2)'(a:i - 0:3)^X1 - x^y{x2 - Xzy{x2-x^y{Xi-x;)*. 
It therefore equals the discriminant of (14) : 

(18) A = -4P3-27(32. 

Any quartic equation and its reaobent cubic have equal discriminants. 

Some writers define the discriminant of (1) to be A/256 and that of a cubic to 
be —A/27. In suppressing these numerical factors, we have spared the reader 
a feat of memory, simplified the important relation between the discriminants of 
a quartic equation and its resolvent cubic, and moreover secured uniformity with 
most of the books to which we shall have occasion to refer the reader. Finally, 
we note that in applications to the theory of numbers, the insertion of the numer- 
ical factors is imdesirable and in special cases unallowable {of, BuU. Amer. Math. 
80c., vol. 13, 1906, p. 1). ^ 

EXERCISES 
1. For ax* + 6x' + cx* + (ix + e = 0, P = p/a*, Q = g/a', where 
p = W — 4ae — c*/3, q = —6*6 + ibcd + lace — ad^ — ^c*. 
The discriminant is defined to be a* A; it equals —4 p' — 27^. 
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2. If X and y are interchanged in 

/ = ax* + 6x*j/ + ex V + <^3:y* + ^> 

a function is obtained which may also be derived from / by merely interchanging 
a with By and & with d. Show that the latter interchanges leave p, 9 and the dis- 
criminant unaltered. 

3. Since the sum Yi + Y2+ Yz of the roots of a reduced cubic is zero, 

71 = 1(1^1 - Yt) + HYi- r,), . . . , 

and any root and hence any function of the roots is expressible as a function of the 
differences of the roots. Thus P and Q in (15) are functions of Yi — 72, etc., 
and hence of 2/1 — Vh etc. Using (16), show that p and q equal polynomials in 
the differences of Xi, . . . , X4. 

4. When x is replaced by x + ty^ let / of Ex. 2 become 

/' = a'x* + h'xhf + • • • + e V. 
Show by Ex. 3 that p and q equal the corresponding functions 

p' = b'd' - 4 a'e' - c'V3, q' = -VH' + • • • . 

5. The results in Exs. 2 and 4 are special cases (used in a short proof) of a gen- 
eral theorem: When x is replaced hy Ix + my and yhyrz + sy, let / become /'. 
Then, using the notations of Ex. 4, we have p' = D^Py q' = Z)^, where D = Is —mr. 
Hence p and q are called irwariards of /. Verify the theorem for the case when x 
is replaced by ix, y by y, 

6. The discriminant is an invariant and the factor is D^, 

7. Using oox* + 4 aix*j/ + 6 oaxV + 4 Ojxy* + (uy* in place of the former /, 
show that p = — 4 7, 9 = 16 /, where 

I = 0004 — 4 OiOs + 3 oj*, / = OoOKU + 2 OiOjOj — OoOi* — 01*04 — o«*. 
In (14) set 7 = 2 2/0; then «* — /« + 2 / = 0. The discriminant is 

256(7'-- 27/*). 

6. Descartes' Solution of the Quartic Equation. Replacing x by 
z — 6/4 in the general quartic (1), we obtain a reduced quartic equation 

(19) ' ^ + q)? + rz + s = 0, 

lacking the term with s?. We shall prove that we can express the left 
member of (19) as the product of two quadratic factors * 

(2* + 2kz + l){z^ - 2kz + m) = z* + (I + m - 4: k^)z^ + 2k{m- l)z + Im. 

* If the coefficients of z be denoted by k and —k (as is usually done), the expres- 
eions (23) for the roots must be divided by 2. But the identification with £uler*8 
solution is then not immediate. 
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The conditions are 

Z + m — 4 fc2 = g, 2 i; (w — = r, Im = 8. 

If k j^ 0, the first two give 

Then lm — % gives 

(20) 64 Jk« + 32gfc^ +^'4 (g^ - 4s)Jfc2 - r« = 0. 

The latter may be solved as a cubic equation for fc*. Any root A* ?^ 
gives a pair of quadratic factors of (19) : 

(21) 22jt2te + ig + 2fc2=Fj^- 

The 4 roots of these two quadratic functions are the 4 roots of (19). If 
g = r = s = 0, every root of (20) is zero and the discussion is not valid; but 
the quadratic factors are then evidently 2^, ^, 

EXERCISES 
^^1. For 2* - 3 2' + 6 2 - 2 = 0, (20) becomes 

64A;« - 3-32 A:^ + 447 ik« - 36 = 0. 

The value A;* = 1 gives the factors 2' + 2 2 — 1, 2* — 2 2 + 2, with the roots 

-1 =fc V2, 1 zfcV^. < ■ 

-"2. Solve 2< - 22^ - 82 - 3 = 0. ^^ ' < 

N3. Solve 2* - 102« - 2O2 - 16 =0. 
4. Solvex*--8x» + 9x' + 8x-10 = 0. 

6. Symmetrical Form of Descartes' Solution. To obtain this sym- 
metrical form, we use all three roots fci^, kf^ k^ of (20). Then 

k^ + fe'^ + ika' = - 1 g, fci Vfcs' = r»/64. 

It b at our choice as to which square root of fci* is denoted by +A;i and 
which by — fci, and likewise as to ±^2, ±^3. For our purposes any 
choice of these signs is suitable provided the choice give 

(22) fcifefcs = -r/8. 

Let k\ 5^ 0. The quadratic function (21) is zero for fc = fci if 
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Hence the four roots of the quartie equation (19) are 

(23) ki + k2 + h, ki — ki — h, —h + ki — kt, — fci — fc2+.fct. 
Writing k^ = i/, we see that, if yi, 1/2, yz are the roots of 

(24) 641/3 + 32^2/2 + 4 (52 - 45)2/ - r« = 0, 
then the roots of (19) are the four values 

(25) z=y/yi + Vy2 + Vyz, 

obtained by using all of the combinations of the square roots for which, 
by (22), 

(26) V^i V^ VrTs = -r/8. 

We have deduced Euler's solution (Ex. 1) from Descartes'. 

EXERCISES 

1. Assume with Euler that quartie (19) has a root of the form (25). Square 
(25), transpose the terms free of radicals, square again, and show that 

2* - 2 (t/i + 2/2 + 2/3) 2^ - 8z V^ V^ v^ -h (2/1 + 2/2 + yiY 

- 4 (2/12/2 + 2/12/j + 2/J2/3) = 0. 

From the relations obtained by identifying this with (19), show that 2/1, 2/«i Vi are 
the roots of the cubic (24) and tliat (2G) holds. 

2. Solve Exs. 1-4 of the preceding set by use of (23). 

3. In the theory of inflexion points of a plane cubic curve occurs the quartie 
equation z} — Sz- — \ Tz — i^ IS^ = 0. Show that (24) now becomes 

and that the roots of the quartie are 

where the signs are to bo chosen so tliat the product of the three summands equals 
+ r/6. Here w is an imaginary culx* root of unity. 

4. The discriminant A of the (juartic wiuation (19) equals the quotient of the 
discriminant D of (24) by 4*. For, the six differences of the roots (23) are 
2 (A'l =h A-i), 2 (A-i =h A's), 2 {kt ± k^). Thus A = 4« L, where 

L = (A'i2 - k2'm,^ - kmkz' - AV)- = (71 - 2/i)»(!/i - 2/.)'(!/2 - 2/i)*. 
By definition, D = W^L. Hence D = 4« a. 
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5. Give a second proof of Ex. 4 by setting y = 2/4 in (24) and then z =Y — 2 q/3. 
We obtain (14), in which now 6 = 0, c = g-, d = r, e = s. The discriminant of 
(14) equals A. Hence A = (21 — z^y . . . = 4*L = D/4*. 

"^6.- If a quartic equation has two pairs of conjugate imaginary roots, its dis- 
criminant A is positive. Hence, if A < 0, there are exactly two real roots. 

7. Theorem.* A quartic equation (19) with q, r, s, redly r j^ 0, and 
with the discriminant A, has 

4 distinct real roots if q and ^s — (f are negative and A > 0, 
no real root if q and 4 s — g^ are not both negative and A > 0, 
2 distinct real and 2 imaginary roots if A < 0, 
at least 2 equ^al real roots if A = 0. 

Since the constant term of the cubic equation (24) is negative, at least 
one of its roots is a positive real number. Let, therefore, yi > 0, so that 
yiUi > 0. Thus ki = Vyi is real. There are four possible eases to consider. 

(a) 2/2 and yz positive. Then each fcy = V^ is real and the roots (23) 
of the quartic equation are all real. 

(&) 2/2 = 2/s < 0- Then ^2 = rfcA^s is a pure imaginary. If fe = fcs, 
the first two roots (23) are imaginary and the last two are real and equal. 
If ^2 = — fcj, the reverse is true. 

(c) 2/2 and 2/8 distinct and negative. The roots (23) are all imaginary. 

(d) 2/2 and 2/3 conjugate imaginaries. Then A^ is imaginary and conju- 
gate with either fcj or — fca, so that one of the numbers ^2 + ^8 and ^2 — ^^3 is 
real and the other imaginary. Just two of the roots (23) are real. 

Now, if A = 0, at least two y's are equal by Ex. 4 of the last set. Thus 
we have case (6) or a special case of (a). In either case, the quartic has 
at least two equal roots, by (17), and they are real in both cases. 

Henceforth, let A ^ 0. By the same Ex. 4, A has the same sign as 
the discriminant D of the cubic equation (24). If A < 0, we have case 
(d). Finally, let A > 0, so that 2/i> 1/2, 2/3 are real. If q is negative and 
g* — 4 « is positive, equation (24) has alternately positive and negative 
coefficients and hence has no negative root, so that we have case (a). 
But if q and 4 « — g^ are not both negative, the coefficients are not alter- 
nately positive and negative, so that the roots yi, yi, yz are not all posi- 
tive,** and we have case (c). 

* Proved by Lagrange by use of the equation whose six roots are the squares of the 
differences of the roots of (19), R^olvUion dea iquationa numiriquea, 3d ed., p. 42. 
•♦ The coeflBioients are - (1/1 + yi + j/i), j/ij/j + ym + j/i^t, - yiytyi. 
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EXERCISES ; 

'1. Apply this theorem to the quartic equations in Exs. 1-4, p. 43. 

2. Verify that a quartic equation (19) with two pairs of equal imaginary roots 
has r = 0. Deduce the last case of the theorem. 

3. Why does the theorem imply its converse? 



t CHAPTER V 
The Fundamental Theorem op Algebra 

l.t Theorem. Every equation mth complex coefficients 

(1) f{z) = 2'* + aiz^-' + . . . + an = 

has a complex (real or imaginary) root. 

For n = 2, 3, or 4, we have proved this theorem by actually solving 
the equation. But for n = 5, the equation cannot in general be solved 
algebraically, i.e., in terms of radicals. 

We shall first treat the case in which all of the coeflScients are real. 
Relying upon geometrical intuition, we have seen in Exs. 3, 5, p. 14, 
that there is a real root if n is odd, or if both n is even and an is negative. 
But, as in the cases of certain quadratic equations and 2* + 2^ + 5 = 0, 
an equation of even degree may have no real root. No proof of the 
theorem for all cases has been made by such elementary methods. 

The proof here given of the theorem that any equation with real co- 
efficients has a complex root is essentially the first proof by Gauss (1799 
and simplified by him in 1849). 

We are to prove that there exists a complex number z = x -\- yi such 
that/(2i) = 0. We may write 

(2) Kz) = X + Yi, 

where X and Y are polynomials in x and y with real coefficients. We 
are to show that there exist real numbers x and y such that 

(3) X = 0, 7 = 0. 

For example, if /(«) = z* - 4 g^ + 9 2* - 16 2 + 20, then 

X = X* - 6xV + y* - 4a:» + I2xy^ + %x^ - %y^ - 16x + 20, 
i y = 2a:*2/ - 2x2/» - Ox^y + 2?/3 + 9xy - 8y. 

The graph of F = is the x-axis iy = 0) and the graph (indicated by the dotted 
curve in Fig. 16, asymptotic to the lines x = 1 and y = ± x) of 

2 (x - l)y« = 2a:» - 6x2 + 9x - 8. 

47 
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Note that there is no real y for x between 1 and 1.73. Smce X = is a quadratic 
equation in y^^ its graph is readily drawn. There is no real ytoTX = 0.05 and 1.6 
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Fig. 16 

and the intermediate values. Cases in which the values of y* are positive and 
rational are 



X 


-4 


-2 


- 1 





2 


3 


y' 


5,148 


Z,Of 04.0 


2,25 


4,5 


1,8 


1,26 



The graphs cross at the points (0, 2), (0, —2), (2, 1), (2, —1), and the roots of 
f{z) = are z = ±2 1, 2 ± i. 

We shall employ also the trigonometric form of z: 

(4) z = r(cos d + i sin d), 
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where ^ ^ < 2 ir. Set < = tan ^ ^. Then 



Thus 






Hence by (1) and (2), 

(l+£'HX+Yi)==r^{l+tiy^+air^-^{l+tiy^-'^{l+(^)+ • • • +an(l+t')\ 
Expanding the terms on the right by the binomial theorem, we get 

(5) X - ^^^) y - g(0 

where F(0 is a polynomial in t of degree 2 n, and G{i) a polynomial in < 
of degree less than 2 n, each with coefficiAits involving r integrally. 

Each point (a;, y), representing (Ch. II, § 8) a complex number 
z = x + yi having the modulus r, lies on the circle x^ + y^ = r^ with radius 
r and center at the origin of the rectangular coordinate system. To find 
the points on this circle for which X = or F = 0, we solve F{t) = or 
G(t) = (in which r is now a constant), and note that to each real root t 
corresponds a single real value of sin^ and a single real value of cos^, 
consistent with that of sin dy and hence a single point (x = r cos d, 
y = rsind). But an equation of degree 2n has at most 2n distinct 
roots (Ch. I, § 15). Since the degree of G{t) is less than that of the de- 
nominator of F in (5), the root i =oo of F = must be considered in 
addition to the roots of G{t) = already examined; for < =oo, d = ir and 
the point is (— r, 0). Thus neither X nor Y is zero for more than 2n 
points of the circle with center at the origin and a given radius r. By 
proper choice of r, this circle will have an arc lying within any given 
region of the plane. Hence neither X nor Y is zero at all points of a region 
of the plane. 

From (4) and DeMoivre's Theorem (Ch. II, § 10), we have 

2* = r* (cos kS + i sin kd). 
Hence, by (1) and (2), 

y = r" sin n^ + aif*-* sin (n - 1)^ + a^r""-^ sin (n — 2)^ + • • • + a^-i r sin 6. 
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Let g be the greatest of the numerical values of ai, . . . , a»-i. Then, if 

1 D I denotes the numerical value of the real number D, 

I - "^ 
r=r»(8inn^ + Z)), |Z)|^ (/(J + ~2+ ' ' ' +^) < (7 — ^ 

provided r > 1. If c is a positive constant < 1 and if r > 1 + g/c^ 
then \D\ < c. Hence for all angles 6 for which sin nB is numerically greater 
than c, Y has the same sign as its first term r" sin nB when r exceeds the 
constant 1 + g/c. 

In our example, we have 

Y = r*sin4^ — 47^8103 ^ + 9r'8in2 ^ — IGrsin^. 

The limit 1 + 16/c for r exceeds 17 and is larger than is convenient for a drawing. 
But for r ^ 10, 

4 9 16 
r =r* (sin 4 ^ + D), | i> | = - + - + ~ = 0.4 + 0.09 + 0.016. 

Taking c == 0.506 = sin 30^ 24', let C be the number of radians in 7** 36'. 

Thus c = sin 4 C. The positive angles e {e < 2 r) 
for which sin 4 ^ exceeds sin 4 C numericaUy are 
those between C and i x — C, between i x + C 
and i x — C, between i r + C and J r — C, . . . , 
between ix + C and 2 r — C For any such 
angle e and for r = 10, 7 has the same sign as 
sin 4 9 and hence is alternately positive and 
negative in these successive intervals, the solid 
arcs in Fig. 17. Denote by 0, 1, 2, . . . , 7 the 
points on the circle with center at the origin and 

radius 10 whose angles ^ are 0, j, — , . . . , — > 

respectively. 

In the general case, denote by 0, 1, 2, ... , 

2 n — 1 the points with the angles 




0^ ^-^ 



(2 n - 1) IT 



n 



on the circle with center at the origin and radius a constant r exceeding 
the above value 1 + g/c. Let nC be the positive angle < ir/2 for which 
sin nC = c. We define the neighborhood of our fcth point of division on 



in 
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the circle to be the arc bounded by the points whose angles are kir/n — C 
and kr/n + C, In Fig. 17 for our example with n = 4, each neighborhood 
is indicated by a dotted arc. In the successive arcs (marked by solid 
arcs) between the neighborhoods, Y is alternately positive and negative, 
since it has in each the same sign as sin ri^. 

It is easily seen that sin ^, sin 2 ^, . . . , sin n^ are continuous functions 
of (a fact presupposed in interpolating between values read from a 
table of sines). Since r is now a constant, Y is therefore a continuous 
function of 0, and has a single value for each value of d. But Y has oppo- 
site signs at the two ends of the neighborhood of any one of our points of 
division on the circle. Hence (as in Ch. I, § 12), Y is zero for some point 
within each neighborhood, and at just one such point, since Y was shown 
to vanish at not more than 2 n points of a circle with center at the origin. 
We shall denote the points on the circle at which Y is zero by 

iQy iiy . . . , 1 2n— !• 

For our example, these points Po, . . . , Py are given in Fig. 18, which shows 
more of the graph of F = than was given in Fig. 16, but now shows it with the 




scale of length reduced in the ratio 4 to 1 (to have a convenient circle of radius 10). 
We have shaded the regions in which, as next proved, Y is positive. 
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Let the constant r be chosen so large that X also has the same sign as 
its first term r" cos ndf for 6 not too near one of the values ir/(2 n), 3 ir/(2 w), 
5ir/(2w), . . . , for which cosn^ = 0. Since these values correspond to 
the middle points of the arcs (01), (12), . . . , no one of them lies in a 
neighborhood of a division point 0, 1, ... . Now cos nO = +1 or — 1 
when d is an even or an odd multiple of ir/^» respectively. Hence X is 
positive in the neighborhood of the division points 0, 2, 4, . . . , 2 n — 2 
and thus at Po, P2, P4, . . . , but negative in that of 1, 3, 5, . . . , 2 n — 1 
and thus at Pi, P3, Ps, . . . . 

We saw that Y is not zero throughout a region of the plane. Hence there 
is a region in which Y is everywhere positive (called a positive region), 
and perhaps regions in which Y is everywhere negative (called negative 
regions), while Y is zero on the boundary lines. 

In Fig. 18 for our example, there are three positive (shaded) regions, the two 
with a single point in common being considered distinct, and three negative (un- 
shaded) regions. Consider that part of the boundary of PtPja which lies inside 
the circle. At every point of it, Y is zero. Now X is negative at Pi and positive 
at Pi and hence is zero at some intermediate point a on this boundary. Hence 
at a both X and Y are zero, so that a represents a complex root (in fact, 2 i) of 

m = 0. 

To extend the last argument to the general case, let R h^ the part in- 
side our circle of a positive re^on having the points P2 h and P2 h+i on its 
boundary. The points of arc P2 kP2 k+i may be the only boundary points 
of R lying on the circle (as for P2P3a and PoPid in Fig. 18), or else its 
boundary includes at least another such arc P2 kPi *+i (as shaded region 
PiP^bP^PTC in Fig. 18). In the first case, X and Y are both zero at some 
point (a or d) on the inner boundary, since X is negative at P2M.1 and 
positive at P2 k and hence zero at an intermediate point. In the second 
case, a point moving from P2 a to P2 *+i along the smaller included arc and 
then along the inner boundary of R until it first returns to the circle 
arrives at a point P2* of even subscript (as in the case of PiP^bP^). In- 
deed, if a person travels as did the point, he will always have the region 
R at his left and hence will pass from P2 1 to P2 k+i and not vice versa. Since 
X is negative at P2*+i and positive at Ptk, it (as also Y) is zero at some 
point b on the part of the inner boundary of R joining these two points. 
Hence 6 represents a root of f(z) = 0. Thus in either of the two pos- 
sible cases, the equation has a root, real or imaginary. 



12] 



FUNDAMENTAL THEOREM OF ALGEBRA 



53 



2.t It remains to prove that an equation F(2) = 0, not all of whose co- 
efficients are real, has a complex root. By separating each imaginary coeffi- 
cient into its real and purely imaginary parts, we have F(z) = P + Qi, 
where P and Q are polynomials in z with real coefficients. Let G{z) = P — Qi. 

The equation 

F{z) . G(z) = P^ + Q^ = 

has real coefficients and hence has a complex root z = a + bi. If this is a 
root of F{z) = 0, our theorem is proved. If it is not, then G{a + bi) = 0. 
Then by Ch. II, § 16, F{a — bi) = 0, and the given equation has the root 
a — bi, 

EXERCISES 

l.t For 2* = 11 + 2 1, draw the graphs of X = 0, F = and locate the three 
roots of the cubic equation in z. 




Fig. 19 

2.t For 2* — 4z — 2 = 0, F = r* sin 5^ — 4r sin ^. Using polar coordinates, 
show that the graph of F = gives the boundaries of the regions in Fig. 19: first 
plot the horizontsd line corresponding to sin 9 = 0, and then, using various angles 
$ (0 5^ 0, t), find by logarithms the corresponding positive r from 

4 sin 9 



r* = 



em5e 
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« 
To find the points on these boundaries (F = 0) for which also 

Xer8cos5^-4rcos^-2 = 0, 
replace r* by the earlier expression. We get 

4 r(sm e cos 5 ^ — cos ^ sin 5 ^) = 2 sm 5 ^, r = — ;r-;: — r- • 

2 sin 4 

Comparing the fourth power of this fraction with that for r*, we get 

sin* 5 9 = 64 sin 9 sin* 4 ^, 

which holds for ^ = 85° 21' 30" or its negative. We then get r and therefore 

the roots 

€,« = 0.11679 =fc 1.4385 i. . 

On the horizontal line are three real roots, best found by methods of approximar- 
tion given later: 

a = 1.518512, & = -0.5084994, y = -1.2435964. 

(H. Weber and J. Wellstein, EncyMopddie der Elementar-Mathematikf ed. 1, I, 
p. 212, p. 296.) 

S.f Other References. For proofs of the fundamental theorem by Gauss, 
Cauchy and Gordan, see Netto, Vorlesungen uber Algebra^ I, p. 25, p. 173. The 
shortest proofs are by the use of the theory of functions of a complex variable, 
and may be found in texts on that subject. For an algebraic proof resting upon 
the theory of functions of a real variable, see Weber, Lehrhuch der Algebray 2d ed., 
vol. 1, pp. 119-142. See also Monographs on Topics of Modem Mathemalica, 1911, 
p. 201, edited by Young (article by Huntington). In the Amer, Math, Monthly, 
vol. 10 (1903), p. 159, Moritz has pointed out hidden assumptions in various in- 
complete proofs. 



CHAPTER VI 

Elementary Theorems on the Roots of an Equation 

1. Relations between the Roots and the Coefficients. Given an 
equation in x of degree n, we can divide its members by the coefficient of 
x" and obtain an equation of the fonn 

(1) /(a;) = a;'* + pix**"! + y^^"'^ + • • • + p^ = 0. 

By the fundamental theorem of algebra (Ch. V), it has a root ai, and 
its quotient by x — ai has a root a%^ etc. Thus 

(2) /(x) = (x - ai)(a: - ^2) • • • (a; - an), 

identically in x. Since the polynomial has n linear factors, each having 
one root, we shall say that the equation has n roots. These may not all 
be distinct; exactly m of them equal ai, if ai is a root of multiplicity m, 
i.e., if exactly m of the linear factors in (2) equal x — ai. Next, 

{x — ai) (a; — ^2) = a:^ — (ai + ai^x + axa^^ 

(x — ai) (a; — a%) (x — as) ^x* — (ai + a2 + a^oi? + (aia2 + aio^ + a2a3)a: — aia2a3. 

Thus f orn = 2 or 3, we see that the product (2) equals 

(3) x**— (ai + • • • + an)a;'»"^ + (aia2 + aias + a2a3 + • • • + an-iaOx**"* 

— (aia2a3+aia2a4+ • • • +an-.2an-ian)a:'*~^+ • • • +( — l)'*aia2 * - * an* 

Multiplying this by a; — a^+i, we readily verify that the product is a 
function which may be derived from (3) by changing n into n + 1. It 
therefore follows by mathematical induction that (2) and (3) are identical. 
Hence (1) and (3) are identical, so that 

ai + a2 + • • • + an = — pi, 
aia2 + aias + • • • + an-ian = P2, 

(4) aia52a8 + aia2a4 + • • • +an-2an-ia„ = — ps, 



aia2 • • • an-ian = (— l)'»Pn. 

For n = 3 and n = 4, the complete formulae were given and proved 
otherwise in Ex. 7, p. 32 and Ch. IV, § 2. 
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In an equation in x of degree n, in which the coefficient of x^ is unity , the 
sum of the roots equals the negative of the coeffi^cient of a;'*"^ the sum of the 
products of the roots two at a time equals the coeffi/n^nt of a;**"^, the sum of the 
products of the roots three at a time equals the negative of the coeffi^cierd of 
a;"""', etc.; finally y the product of the roots equals the coristant term or its nega» 
live according as n is even or odd. 

For example, in a cubic equation having the roots 2, 2, 5, the coeffi- 
cient of X equals 2- 2 + 2-5 + 2. 5 = 24. 

Given an equation Oox" + aix"""^ + . . . = 0, we first divide by Oo and 
then apply the theorem to the resulting equation. Thus the sum of the 
roots equals — ai/oo. 

EXERCISES 

1. Find the quartic equation having 2 and —2 as double roots. 

2. Find the remaining root in Exs. 1, 3, p. 9. 

3. If a real cubic equation a:* — 6 x* + • • • = has the root 1 + "v^— 5, 
what are the remaining roots? 

4. Form by the theorem the equations in Exs. 3, 4, p. 15. 

5. Given that a:^-2a:'-5x*-6x + 2 = has the root 2 - \/3, find 
another root and, by using the sum and product of the four roots, form the quad- 
ratic equation for the remaining two roots (avoid division). 

6. Find, by use of (4), the roots of x* — 6 a:* + 13 x' — 12 a; + 4 = 0, given 
that it has two double roots. 

" 7. Solve x' — 3 x^ — 13 ^ + 15 = 0, with roots in arithmetical progression. 

8. Solve 4x' — 16x- — 9x + 36 = 0, one root being the negative of another. 

9. Solve x' — 9 x^ + 23 X — 15 = 0, one root being triple another. 

-10. Solve x' — 14 x^ — 84 X + 216 == 0, with roots in geometrical progression. 

11. Solve X* — 2 x* — 21 X- + 22 X + 40 = 0, with roots in arithmetical pro- 
gression. Denote them by c — 3 6, c — fe, c + 6, c + 3 6. 

12. Solve x<-6x5 + 12x2- 10x + 3 = 0, with a triple root. 
- 13. Find a necessary and sufficient condition that 

fix) sx^ + pix^ + p%x + ])i = 
shall have one root the negative of another. Note that 

(aj + ai){ai + 03) (ai + aj) 

is obtained by substituting x = — pi in (2). 

14. If for w = 4 the roots of (1) satisfy the relation aiat = 0104, then pi*p4 = Pi*. 
Note that (4) gives 

— P3 = aiOiioi + ofi) + cuai^ai + at) = —piaiofj. 

15. Wliat is the coefficient of //"""^ in the equation 2/" +•••=» whose roots 
are 01 — A, • • • , a^ — A, when the as are the roots of (1)? For what value of 
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h is this coefficient zero? Hence to remove the second term of an equation by 
replacing x by y + hf what value of h must we take? Check by the binomial 
theorem. 

16. Find the equation whose roots are the roots of a:^ — 6x^ + 4 = each 
diminished by 3. Remove the second term by transformation. 
M7. Prove the binomial theorem by taking the as all equal in (2) and (3) and 
counting the number of terms in each coefficient of (3). 

18. Using (1) and (2), show that 

(1 - «1«)(1 - a,») ••• (l-«n^) = (l+P2 + P4+ •••)'-(?! + P3 + P6+ " ')\ 
(1+«1«)(1 + «,»). . •(l + an2) = (l-p2 + p4 r+(Pl-p8 + P6- •••)'. 

19. Since rci, . . . , Xaj determined by relations (8) of Ch. IV, give the correct 
values of the sums (9)-(ll), they are the roots of the quartic equation. Why does 
this give a new solution of the quartic? 

20. Using Ex. 6, p. 32, make a similar argument for the cubic. 

2. Upper Limit to the Positive Roots. For an equation 

fix) = oox'* + aix^'^ + • • • + an= (oa 9^ 0) 

with real coefficients, we shall prove the 

Theorem. // Oo, ai, . . . , at-i are eocA^O, while a* < 0, and if G is 
the greaiest of the numerical values of the negative coefficients j each real root is 

less than 1 + VG/ao. 

For positive values of x, f(x) is numerically greater than or equal to 

aoX'» - (?(a:'»-* + x""-^"^ + - - - + x + 1) 

„ ^ f x^-^^ - 1\ a:'»-*+Mao(x* - x*-^ -Gl+G 
= «oX'»-G^ ^__^ j ^-^ 

But, if a; > 1, a:* — x^'^ = (a; — 1)*. Hence if a;= 1 + y/Qj^ 

ao(x* - a*-^ ^ G, f{x) j^ 0. 

3. Another Upper Limit to the Roots. // the numerical value of each 
negative coefficient be divided by the sum of all of the positive coefficients 
which precede it, the, greatest quotient so obtained when increased by unity 
gives an upper limit to the positive roots of the equation. 

If the coefficient of a;*" is positive, we replace x^ by 

(x - l)(af»-^ + af»-* + • • • + a: + 1) + 1. 
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The argument will be clearer if applied to a particular case: 

fix) = posfi - pix* + pzx^ + pzpi^ - PiX + Pb = 0, 

where each pi is positive. Then f{x) is the sum of the terms 
Pq{x — l)a:* + Pq{x - l)^^ _|_ j^(^j. _ i^^ _|_ j^(^j. _ i)^. _|_ ^^(j. « i) + p^ 

— Pix* P2{x - l)x2 + P2(a; — l)x + pi{x — 1) + pi 

jh(x - l)x + p3(x - 1) + ps 
— p4a: Pb. 

The sum of the terms in each column will be positive, if a; > 1 and 
po (x - 1) - pi > 0, (po + P2 + Pz)(x - 1) - p4 > 0, 

since only in the first and fourth columns is there a negative part. These 
inequalities both hold if 



Po Po + P2 + P3 

EXERCISES 

Apply the methods of both § 2 and § 3 to find an upper limit u to the roots of 

1. 4x*-8x* + 22j:' + 98x«-73i: + 5 = 0. By §2, m = 1 + 73/4. By § 3, 
u= 3, since 1+8/4 = 3, 1 + 73/124 < 3. _ 

2. a:* + 4i:*-7x2-40x + l = 0. By § 2, a = 1 + V^40 = 4.42. By§3,ii = 9. 
"3. X*- 5x3 + 7x2 - 8x + 1 = 0. 

^4. x7 + 3x«- 4x^ + 5x^-6x3 -7x2- 8 = 0. 

5. x7 + 2x5 + 4x^-8x2 -32 = 0. 

6. If A Is the greatest of the numerical values of ai, . . . , On, each root is 
less than 1 + A/gq. In the proof in § 2, set ^ = 1 and replace G by A. 

7. A lower limit to the negative roots of /(x) = may be found by applying 
the above theorems to/(— x) = 0. To obtain a lower limit to the positive roots 
consider /(I /x) = 0. 

8. Find a lower limit to the negative roots in Exs. 3, 4. 

9. Find a lower limit to the positive roots in Ex. 5. 

4. The Term "Divisor." In certain texts it is stated that the relation 
ai a2 . . . a,» = d: Pn in (4) implics that "every root of an equation is a divisor 
of the absolute tenn.'* This statement is either trivial or else is not always true. 
It is trivijil if it means merely that the absolute term can be divided by any root 
(that root being a complex numlxjr), \ielding a quotient which b a complex num- 
ber. For, in this sense division is always possible (except when the divisor is 
zero), and a root not zero is a divisor of any number whatever. The statement 
quotwl was certainly not meant in this trivial sense, with no special force. The 
only other sense, familiar to the reader, in which a constant is said to be a divisor 
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of another constant is the following: An integer r is a divisor of an integer p if 
p/r is an integery so that p = rq^ where q is an integer. For example, 4 is a divisor 
of 12, but not of 6. In this reasonable sense of the term divisor in such a connec- 
tion, the statement quoted becomes intelligible only when modified to read: every 
integral root of an equation with an integral absolute term is a divisor of that 
term. But this is not always true. The integral root 6 of x* — ^^ x + 4 = is 
not a divisor of 4; the root 2 of x^ — i x — 3 = is not a divisor of —3. The 
correct theorem is that next stated. 

6. Integral Roots. For an equation all of whose coefficients are inte- 
gerSf that of the highest power of the variable being unity, any integral root is 
a divisor of the constant term. 

In certain texts, we find a correct statement of this theorem, but an erroneous 
proof. When ai and Pn are integers and aia2 . . . ofn = d: Pn, it is falsely con- 
cluded that ai is a divisor of Pn- But 12 • 3 • i = 9 and 12 is not a divisor of 9. Also 
the examples at the end of § 4 show the falsity of this argument and, indeed, of 
any argument not making use of the hypothesis that all of the coefficients are 
int^ers. 

A correct proof is very easily given. Let d be an integral root of equa- 
tion (1), in which now pi, . . . , pn are all integers. Then 

(5) d'* + pid'*-^ + p2d'»-2 + • • • + Pn-id + Pn = 0. 

Since d obviously divides all of the terms preceding the last term, it must 
divide Pn- 

Hence if there be integral roots of an equation of the specified type, 
they may be found by testing in turn each positive and negative divisor 
d of the constant term p^. The most obvious test is to compute (by the 
abridgment in Ch. I, § 5) the value of f(d) and note whether or not this 
value is zero. We may shorten the work very much by various methods, 
and most by a combination of these methods. 

Evidently it is imnecessary to test a value of d beyond the limits of the 
positive and negative roots. 

6. Newton's Method for Integral Roots. Consider an equation (1) 
with integral coeflScients. Let d be an integral root. It is a divisor of pn 
and we may set 

Pn = dqn-l. 

By removing the factor d from each term of (5), we get 

d«-i + pid*^* + . . . + p^_2d + p„_i + g^i = 0. 
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The left member is divisible by d, and hence 

where g„-2 is an integer. Then 

d«-2 + Pid«-» + . . . + p„_3d + Pn-2 + qn-2 = 0, 

Pn-'2 + Qn-^l = dqn-Z, 

where g»-3 is an integer, etc. Conversely, if such a relation holds at 
each step and if, finally, 1 + go is zero, then d is a root, and the quotient 
of f(x) by a; — d is 

x"-^ - qix""-^ - q2X''-^ — ... - q^_^x - gn-i- 

Indeed, in the product of the latter by a; — rf, the coeflScient of a;**"* for 
i > is dqt-i — qt and this equals pt by our relations. 

Corollary. If d is an integral root of an equation f(x) = a:'* + • • • = 
with integral coefficients, the quotient of f{x) by a; — d is a polynomial 
with integral coeflScients. 

This process is a modification of s\Tithetic division (Ch. X, §4). 

Example. f{x) = x* — 9 x' + 24 x^ — 23 x + 15 = 0. Since e\idently there 
b no negative r6ot, and since 10 is au upper limit to the positive roots, we have 

For d = 3, the work is as 

15 



only to test the divisors 1, 3, 5 of 15. 


Now /(I) = 8. 


follows: 




1 -9 


24 -23 


-1 6 


-6 5 



- 3 18 - 18 

Here we have divided 15 by 3 and placed the quotient under — 23. Adding, wo 
get —18, whose quotient by 3 is added to 24, etc. Since the last sum is zero, 3 is 
a root. The quotient has as its coefficients the negatives of the numbers in the 
second line (see the first line below). We test this quotient for the root 5: 

1-6 6-5 

- 1 1 -JL 

0-5 5 

Hence 5 is a root and the quotient is x' — x + 1. The latter does not vanish for 
X = ±1, Hence 3 and 5 are the only integral roots and each is a simple root. 
If we had tested a divisor —3 or 15, not a root, a certain quotient would not be 
integral and the work would be stopped at tl&t point. 

7. Another Method. A divisor d is to be rejected if d — m is not a 
divisor of /(m), where m is any chosen integer. 
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For, if d is an integral root of f{x) = 0, 

fix) ^(x-d) Q{x), 

where Q(x) is a polynomial with integral coeflScients (§ 6, Cor.). Then 
f{m) = (m — d)g, where q is the integer Q(m). 

In the example of § 6, /(I) = 8 is not divisible by 14, so that 15 is not an integral 
root. 
Consider the new example 

fix) s x3 - 20x2 + 164a: - 400 = 0. 

There is no negative root and 20 is an upper limit to the roots. The positive 
divisors of 400 less than 20 are 1, 2, 4, 16, 5, 8, 10. The last three are excluded 
since /(I) = —255 is not divisible by 4, 7, or 9. Also 16 is excluded since /(2) = 
— 144 is not divisible by 14. Incidentally we have excluded the divisors 1 and 2. 
The remaining divisor 4 is seen to be a root either by Newton's method or by 
computing /(4). 

In case there are numerous divisors within the limits to the roots, it is 
usually better not to begin by listing all of the divisors to be tested. For, 
if a divisor is found to be a root, it is preferable to proceed with the quo- 
tient, as was done in the Example in § 6. 

EXERCISES 

Find all the integral roots of 

1. x3 - 10x2 + 27x- 18 = 0. 

2. x^-2x3-21x2 + 22x + 40 = 0. 

3. X* + 47x* + 423x3 + 140x2 + 1213x - 420 = 0. 

4. X* - 34x3 + 29 x2 + 212x- 300 = 0. 

8. Rational Roots. Any rational root of an equation with integral 
coefficients, that of the highest power of the variable being unity , is necessarily 
an integer. 

Let a/b be a root, where a and b are integers with no common divisor 
greater than unity. Set x = a/b in (1) and multiply the members of the 
resulting relation by 6**"^ We get 

a^ 

All of the terms after the first are integers. Hence b divides a**. Unless 
b = ±1, 6 has a prime factor which divides a" and hence also a, contrary 
to hypothesis. Thus a/b = ±a is an integral root. 
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The rational roots of any equation with rational coefficients can now 

be readily found. If I is the least common denominator of the fractional 

coefficients, we multiply the members of the equation by I and obtain an 

equation 

aoy"" + aiy"-^ + • • • + an = 0, 

where a©, . . . , a„ are integers. Multiply the left member by Oo""^ and set 
Ooy = X, We obtain an equation (1) >vith integral coefficients, that of 
x** being unity. To any rational root 2/1 of the equation in y corresponds 
a rational root Oot/i of (1), which must be an integer, in view of the theorem 
just proved. Hence we need only find all of the integral roots of the new 
equation (1) and divide them by Oo to get all of the rational roots y of the 
original equation. 

Frequently it is sufficient (and of course simpler) to set Ay = x, where k 
is a suitable integer less than Oq* 

EXERCISES 

Find all of the rational roots of 

1. 2/*- V!/' + 4-y'-402/ + 9 = 0. 

2. G7/^-ll!/2 + 62/- 1 = 0. 

3. 1081/3 -2702/2 -422/+ 1 =0. [Use ^ = 6.1 

4. 32?/' - 62/ - 1 = 0. [Use the least k.] 

Fonn the equation whose roots are the products of 6 by the roots of 

5. j2 - 2x - i = 0. G. x^- ix^- lx + i = 0. 



CHAPTER VII 

Symmetric Functions 

1. 2-polynomials ; Elementary Symmetric Functions. A polynomial in 
the independent variables xi,0C2,...,Xnis called symmetric in them if 
it is unaltered by the interchange of any two of the variables. For example, 

is a symmetric function of Xi, X2, Xz- The sum of the first three terms is 
denoted by Xxi^ and the sum of the last three by 3 Sxi. In general, if t 
is a product of powers of Xi, . . . , Xn, whose exponents are integers ^ 0, 
Zt denotes the sum of this term t and all of the distinct terms obtained 
from it by permutations of the variables. Since such a Z-polynomial 
Zt is unaltered by every permutation of the variables, it is unaltered in 
particular by the interchange of any two variables and hence is a sjrm- 
metric function. For example, if there are three variables a, fi, 7, 

Xofff^y = 0^/337 + ^oc^y + ofy^fi + y^o^fi + ff^^a + y^^ot. 

Just as in the case of the initial example, any symmetric polynomial is 
evidently a linear combination of 2-polynomials with constant coefficients. 
The S-polynomials, of the first degree in each variable, 

(1) ^1 = SXi, E2 = 2Xia<2, Ei = XXiX2Xzf . . . , ^n = 3:10:2 . . . Xnr-lXn 

are called the elementary symmetric functions of aJi, . . . , Xn» 

Frequently we shall employ the notation «!,...,«„ for the indepen- 
dent variables. By Ch. VI, § 1, ai, . . . , an are the roots of an equation 
of degree n, 

(2) fix) s x** + piX'*"^ + jhx'''^ • • • + Pn = 0, 

in which — pi, jh, — Pa, • • • , {~^)'^n equal the elementary symmetric 
functions of the roots. It is customary to make the 'latter statement 
also for an equation whose roots are not independent variables. 

63 
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But in the latter case it is preferable to say that — pi, pj, . . . equal the elemen- 
tary symmetric functions formed for the roots, thus indicating that we have in 
mind the values of certain functions of arbitrary variables Xi, . . . , Xn for Xi = ai, 
, , , , Xn= an. It may happen that the resulting polynomials in cei, ..., an are 
not sjinmetric in ai, . . . , «„. For example, if the three roots are a, /9, /9, we have 
— pi = a + 2 /9, pi = 2 a/3 + /3*, — ps = a$^, which are the values of Xi + xj + xs, 
etc., but are not themselves s>Tnmetric in a, /3, /9, being altered by the interchange 
of a and fi. 

However, this point will give no trouble in the exercises below, since the roots 
are given distinct notations and may, if it is desired, be regarded as independent 
variables. 

2. Products of 2-polynomials. It is a fundamental theorem that any 
symmetric polynomial in the roots is expressible rationally and integrally 
in terms of pi, P2, . . .. , Pn and the coeflScients of the symmetric poly- 
nomial. To prove this, it suffices to show that any S-polynomial is ex- 
pressible rationally and integrally in terms of the elementary synametric 
functions. Postponing the general proof, we shall now treat several special 
cases and assign others as exercises. 

Example 1. If a, /3, 7 are the roots of x* + px* + ^x + r = 0, 

p2 = (a + /3 + 7)^ = a^ + /32 + 7^ + 2 (a/3 + a7 + /37) = So* + 2 q, 
Sa2 = p2 _ 2 g, - pq= Za- 2a/3 = So^/J + 3 afiy, Sa*/3 = 3 r - p^, 
Zot^fiy = pr^ 2:02/32 = {XafiY - 2 a/372a = q^ - 2 pr. 

The student should carr>' out in detail the steps here indicated. 

Example 2. The student should learn how to express a product like Sa • So/J 
in Ex. 1 as a sum of Z-f unctions \vithout writing out their expansions, since the 
latter method is vcr>' laborious in general. To obtain the types of S-functions 
ill the product, it suffices to use a single term (called leader) of one factor, say a. 
Then if we use any tenn of wa/3 which contains a, we get a term of Zc^fi; while 
if we use any tenn not containing a (hence 0y in this example), we get a term 
a/37. It remains to find the coefficients of these S-functions ^a^/S and a/87. To 
get o2/3, we must take the tenn a of Za and the term a/S of 2a/3, so that Zc^fi has 
the coefficient unity. To get a/S7, we may take a or /S or 7 from 2a and the com- 
plementary factor ^7 or a7 or a/3, resixjctively, from 2a/8. Hence 

:i:a . i:a/3 = 2:a2/3 + 3 a/37. 

3 3 G 

As a check, we have marked under each 2 the number* of its terms. Then the 
total numl)er of terms is 3 X 3 = G -|- 3. 

* Found by the theory of combinations in Algebra, and not by writing out in full 
the S-functiona. 
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Example 3. To find the product of the Z-functions 

Sa/3, S = Sa2/3, 

of a, fif 7, 5, we use the leader a/9 of the first. To obtain the four types of 
Z-f unctions in the product, we first use a term of s containing both a and /3; 
second, a term of s containing o^ but not /3; third, a term with a but with neither 
a* nor /3; fourth, a term free of a and /3. The respective types are those in 

Sa/9 • Sa2/3 = 1 So^/S^ _^ 2 So^/J^ + 2 So^^y^ + 3 ^afiyH. 

6 12 12 12 12 4 

The coefficient of any S-function on the right is obtained by counting the num- 
ber of ways its leader can be expressed as a product of terms of the 2-functions 
on the left. 

The coefficient of c^fiy^ is 2 since we must take either afi or fiy from Sa/S (for, 
we miist take a or 7, since s does not have a term with two exponents equal to 2; 
while if we take 07, the complementary factor afiy is not in s). To obtain afiyH, 
we must take a term from s with 7* and a or /3 or 5, The first and second coefficients 
are evidently correct. 

EXERCISES 

If a, /S, 7, 5 are the roots of x* + p2^ + qx^ + rx + s = 0, find 

1. Sa*/32 [Square Sa/S.] 

2. Sa3/3. [Use Sa2 . 2a/8.] 

3. Sal [Square Sa^.] 

If a, /3, 7 are the roots oi x^ + px^ + qx + r = 0, find the cubic equation with 
the roots 

2 2 2 

4. a^, /3^, 7^ 5. a/9, a7, /37. 6. -> —1 -• 

a /S 7 

By multiplying Sxi by a suitable 2-f unction, express in terms of functions (1) 
7. Sxi^ (if n > 1). 8. Sxi^xz if (n > 2). 9. Sxi^xj (if n = 2). 

10. Sxi' (if n > 2). 11. Sxi^ (if n = 2). 12. Sxi^xaa;,. 

13. For equation (2) with yi > 4, show that 

Zai^a2a3a4 = — P1P4 + 5 ps, SaiWaa = 3 piPi — P2P3 — 5 ps. 

14. For equation (2) with n > 5, show that 

I^aiWazoii = p2Pi - 4 P1P6 + pe, ^ai^ai^a^^ = ps^ - 2 P2P4 + 2 piPs - 2 pe. 

3. Fundamental Theorem on Symmetric Functions. Any polynomial 
symmetric in Xi, . . . , Xn equals a polynomial in the elementary symmetric 
functions Ei, , , . , En of the x^s. 

The proof, illustrated in Exs. 1 and 2 of §4, tells us just what elemen- 
tary symmetric functions should be multiplied together in seeking the 
expression for a given symmetric polynomial in terms of the E^s and 
hence perfects the tentative method used in the earlier examples. 
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It suffices to prove the theorem for any homogeneous symmetric poly- 
nomial S, i,e.j one expressible as a sum of terms 

h = aa;i*»a^*i . . . x^f* 

of constant total degree fc = fci + fe + • • • + fcn in the x's. Evidently 
we may assume that no two terms of S have the same set of exponents 
fci, . . . , fcn (since such terms may be combined into a single one). We 
shall say that h is higher than the term bxi^ix^^ . . . Xn'* if ki > Zi, or if 
fci = Zi, ^2 > Z2, or if fci = Zi, ^2 = Z2, fcs > Z3, . . . , so that the first one of 
the diflferences fci — Zi, fe — Z2, fcs — Z3, . . . which is not zero is positive. 
If the highest term in another symmetric polynomial S' is 

h' = a'xf^'xj^' . . . Xn*"', 

and that of S is A, then the highest term in their product SS' is 

hh' = aa'a;i*»+*»' . . . x^*" "•"*•'. 

Indeed, suppose that SS' has a term, higher than hh'y 

(3) cxi'i+'i' . . . a;n'-+'-', 

which is either a product of terms 

t = hx^^ . . . Xn'-, t' = 6'Xi'»' . . . Xn*"' 

of S and S' respectively, or is a sum of such products. Since (3) is higher 
than hh\ the first one of the differences 

l + W - frl - fcl', . . . , Zn + Zn' - fcn - K' 

which is not zero is positive. But, either all of the differences Zi — fci, . . . , 
In— kn are zero or the first one which is not zero is negative, since h is 
either identical with t or is higher than L Likewise for the differences 
Z/ — fci', . . . , Z„' — An'. We therefore have a contradiction. 

It follows at once that the highest term in any product of homogeneous 
s>'mmetric polynomials is the product of their highest terms. Now the 
high(»st t^rms hi Ei, £2, £3, . . . , £», given by (1), are 

Xij XlJ^2) XiX^XZf . . . , XiXi . . . Xnj 

respectively. Hence the highest term in Ei^^Ez*^* . . . En^ is 



n • 
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We next prove that, in the above highest tenn h of S, 

/Ci ^~ IC2 ~~ n/3 • • • ^~ (vfi* 

For, if Ai < fe, the symmetric polynomial S would contain the term 

which is higher than h. If ^2 < h, S would contain the term 

higher than A, etc. 
By the above result, the highest term in 

is h. Hence Si = S — (r is a homogeneous symmetric polynomial of the 
same total degree /b as S and having a highest term hi not as high as h. 
As brfore, we form a product ai of the -B's whose highest term is this hi. 
Then S% = Si — ai is a homogeneous symmetric polynomial of total 
degree k and with a highest term A2 not as high as hi. We must finally 
reach a difference St — (ft which is identically zero. Indeed, there is 
only a finite number of products of powers of Xi, . . . , x^ of total degree A;. 
Among these are the parts h\ hi, h%, . . . of A, Ai, A2, . • . with the coeflS- 
cients suppressed. Since each A^ is not as high as /i»-i, the fe'. A/, ^2', . . .are 
all distinct. Hence there is only a finite number of A». Since St — at ^ 0, 

iS = (r + Si = (r + (ri + S2= • • • =0' + (ri + (r2+ • • • +o'<« 

Hence S is a polynomial in -Bi, £2, . . . , En* 

• 

4. At each step of the preceding process, we subtracted a product of 
the -B's multiplied by the coefiicient of the highest term of the earlier 
function. It follows that any symmetric polynomial equals a rational 
integral function^ with integral coefficients, of the elementary symmetric func- 
tions and the coeffixAerds of the given polynomial. 

Corollary. Any symmetric polynomial with integral coeffixnents can 
be expressed as a polynomial in the elementary symmetric functions with 
integral coefficients. 

Instances of this important Corollary are furnished by the results in 
all of our earlier examples and in those which follow. 
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Example 1. If /S = Sxi^xi^s and n > 4, we have 

<r = EtEz = /S + 3 2xi*xaa;jX4 +10 ^XiX^^^^, 
Si = iS — <r= —3 ^xficittpCi — 10 2x1X1X9X4X6, 
<ri = — 3 ^1^4 = — 3 (2xi*XaXjX4 + 5 2x1X3X9X4X6), 
S2 = iSi — <ri = 5 2x1X^X9X4X5 = 5 Eif 

iS = <r + /Si=<r + <ri + iS2 = EtEz — 3 ^1^4 + 5 E^, 

Example 2. U S = 2xi'x9X9 and n > 4, 

a = ^1*^9 = -fi^i (2xi*XaX9 + 4 2x1X3X9X4) 

= 2xi'xjX9 + 2 2xiWx9 + 3 2x1^x9X9X4 

+ 4 (2x12x9X9X4 + 5 2x1X9X9X4X5), 

iSi = iS — <r= —2 2xi2x2*X9 — 7 2x1^X9X9X4 — 20 2x1X1X9X4X5. 

Take <ri = — 2 E2EZ and proceed as in Ex. 1. 

Remark. The definition of a 2-polynomial in § 1 may be extended to 2J-fimc- 
tions in general. For instance if there are three variables a, fi, 7, 

^^a a p y ^^a a a p p y y 

EXERCISES 

If a, /9, 7, 5 are the roots of X* + px* -|- gx* + rx + s = 0, 

6. Find the sums in Exs. 1, 3, 4 from the sum, sum of the products two at a 
time, and sum of the squares of the roots of 

l+py + qy'-bry^-b si/ = 0, 
obtained by replacing xhy 1/y in the former quartic equation. 






a ^^ ot ^^ a 

12 4 
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10 



12. Prove that the degree in any single x of a homogeneous symmetric poly- 
nomial S is the total degree of the equal polynomial in the ^'s. Hints : First show 
that no term of S has an exponent > ^i, so that the degree of iS in any single z 
is ku Next, <r is of total degree h in the E's. Set hi = o'a;i*»' .... Then <n 
is of total degree A;i'( — A;i) in the E's and not every exponent in <ri equals the corre- 
sponding exponent in <r. Thus <r is not cancelled by <ri, <r2, . . . . 

13. Given a polynomial in the E^s of total degree d, show that the equal func- 
tion of the x^8 is of degree — d in any single root. 

6. Sums of Like Powers' of the Roots. If ai, . . . , an are the roots of 
(2), we write Si = 2ai, S2 = Sai^, and, in general, 

Sk — Sai* = ai* + a2* + • • • + ^n*- 
The factored form of (2) is 

(4) f{x) = (x — a^{x - a2) . . . (x - aj. 
In this identity in x, we may replace xhy x + h. Thus 

fix + A) s (x 4- A — ai)(x + A — a2) . . . (x + A — an). 

In the expansion of f{x + A) as a polynomial in A, the coeflScient of the 
first power of A is /'(x), by the definition of the first derivative of f{x) in 
Ch. I, § 4. In the right member, the coefficient of A is 

{x — a^{x — as) ... (x — a„)+ • • • 4-(x — ai)(x — a2) . . . (x — an-i). 

Here the first product equals /(x) -f- (x — ai), by (4), etc. Hence 

(5) /'(,)^J(£L + JW_+...+J(^. 

X — ai X — a2 X — an 

If a is any root of (2), /(a) = and 

fix) fix) — /(a) x** — a** , x**"^ — a**"^ , , x — a 

= = -— h Pi [-•••+ Vn-\ 

X — a X — a X — a x — a x — a 

= x'*"^ 4- ax'*"^ + a^x'*"' + • • • + PiCx'*"^ + ax'*"' + • • • ) 
.4- p2(x'»-' +•••)+•••, 

(6) -^^^^^ = x"-i + (a + pOx''-^ + (o^ 4- pia 4- P2)x'»-» 4- • • • 
X — a 

4- (a* 4- Pio*"^ 4- P2a*-* 4- • • • 4- p*-ia 4" pjb)x'*"*-^ 4" • • • . 
Taking a to be ai, . . . , an in turn and adding the results, we have by (5) 
fix) = nx""^ 4- (si 4- npi)x*~^ 4- (s2 4- PiSi 4- np2)x'*"* • • • 

+ (s* 4- Pis*-i 4- P2S*-« 4- • • • 4-p/b-iSi+npifc)x'»-*-i4- • • • . 
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By Ch. I., § 4, 

/'(a:) = na;"-^+(n-l)pia:"-2+(n-2)p2a:'»-3+ • • • +(n-A;)p*a:"-*-*+ • • •• 
Since the coeflScients of like powers of x are equal, we get 

(7) si + Pi = 0, S2 + piSi + 2 p2 = 0, . . . , 

8k + P\Sk-i + P2Sk-2 + • • • + Pk-iSi + fcp* = (fc = 1, 2, . . . , n - 1). 
We may therefore find in turn si, S2, . . . , Sn-ii 

(8) si = -pi, 82 = pi^ - 2 p2, ss = -pi^ + 3 P1P2 - 3 ps, . . . . 

To find Sn, replace x in (2) by «!,...,«» in turn and add the resulting 
equations. We get 

(9) 8n + p;8n-l + P2Sn-2 + ' * ' + pn-l^l + npn = 0. 

We may combine (7) and (9) into a single formula: 
(10) Sk + pi.s>-i + P28k-2 + • • • + Pife-iSi + kpk = (A; = 1, 2, ... , n). 

T(^ derive a formula which shall enable us to compute the Sk for fc > n, 
wo multiply (2) by x*"'*, take x = ai, . . . , x = «» in turn, and add the 
n»sulting equations. We get 

(11) Sk + PlSk-l + P25jt-2 + • • • + PnSjfc-n = (fc > m). 

Relations (10) and (11) are called Newton^s formulce. They enable us 
to express any Sk as a pol>Tiomial in pi, . . . , p^. 

EXERCISES 

1. For a cubic equation, «4 = Pi^ — 4 pi^j -h 4 pipi + 2 pj*. 

2. For an equation of dopree n ^ 4, Si = pi* — 4 7>i*/)j 4- 4 pipi + 2 pj* — 4 p4. 

3. If we define p„+i, p„+i, ... to be zero, relations (10) hold for every fc. 
Hence if pi, Pi, . . . are arl)itrar}' numbers unlimited in number, and if <ri, oi, . . . 
arc computed l)y use of 

<rk + Vm^l 4- • • • + Pfc-i<ri + fcp* (fc = 1,2,...), 

ck l)ecomes Sk when wo take p„+i = 0, p„+2 = 0, . . . . See Exs. 1 , 2. 

4. For X" — 1 = 0, »t = w or according; as fc is divisible or not by n. 

6. 2-f unctions Expressed in Terms of the Functions «*. We have 

8aSb = -ai* • 2ai* = 2ai«+* + m 2ai«a2*, 

(12) Sai^'Ofo* = - (SaSt - Sa+6), 

where m = 1 if a t"^ b, m = 2 if a = 6. 
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Any 2-f unction with a term involving just three roots may be denoted 
by SaiWoaS a ^ 6 ^ c > 0. If 6 > c, 

for m as above. Since a + 6>c, a + c>6, 

1 

(13) Sai«a2W = — (5oS6Sc — SaSb+c — SdSo+c — ScSo+6 + 2 Sa+M-c) (6 > c). 

7/1' 

But if 6 = c, we have 

SoSaiW = r 2ai«a2*a3* + Sai«+*a2*, 
where r = 1 if a > 6, r = 3 if a = 6. Hence 

(14) SaiWoa* = K«aS6^ — S0S26 — 2 S6Sa+* + 2 Sa+26) (a > 6), 

(15) 2ai«a2*a3« = t (So' - 3 S0S2 a + 2 S3 «) . 

The fact that any Z-polynomial can be expressed as a polynomial in the 
functions Sk is readily proved by induction. We have 

\ + • • • +^2aiW . . . ar""^^ 

where /is a positive integer, and ^i, . . . , t are integers ^ (for example, 
<r = if ^ = 6, since the terms which it multiplies are included in the sum 
multiplied by /i). Hence if every Sai*i . . . aA is expressible as a poly- 
nomial in the functions s*, the same is true of every Zafa^ . . . ar^i°. But 
the theorem is true for r = 1 (by the definition of «*). Hence it is true by 
induction for every r. 

EXERCISES 

1. Take a = 6 in (13) and then replace c by a. Hence (14) holds also when 
a <h. Derive this result just as we did (14). 

2. Express 2 ai^aJ^az^cU^ in terms of the s*, treating all cases. Why are these 
formuke unnecessary if the equation is of degree four? 

3. For a quartic equation express the functions 

Xai^aff Zai'as, Zai^a2as, Zai'as'os 

in terms of the 8jb and ultimately in terms of the pi, . . . , p4. 

7. Since any s* equals a polynomial in pi, . . . , pn(§ 5), the theorem 
of § 6 shows that any S-polynomial (and hence any rational integral 
symmetric function) of the roots of an equation equals a polynomial in 
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the coefficients pi, . . . , pn of the equation. Since we may form an equa- 
tion with arbitrarily assigned roots, we have a new proof of the funda- 
mental theorem on symmetric functions (§3). 

The method of §§ 5, 6 to express a S-polynomial in terms of the coeffi- 
cients is advantageous when a term of S involves only a few distinct 
roots, but with high exponents, while the method of §§ 2, 3 is preferable 
when a term of 2 involves a large number of roots >vith low exponents. 

8. Waring's Formula * for s,^ in Terms of the Coefficients. We shall 
first derive this formula by a very brief argument employing infinite 
series in a complex variable, and later give a longer but more elementary 
proof. 

In (2) and (4) replace z by l/y and multiply each by y". Thus 

(16) 1 + pi2/ + P2y^ + • • • + Pny"" =(1 - ai2/)(l - cx2y) . . . {I— any). 

Take the natural logarithm of each member, noting that the logarithm 
of a product equals the sum of the logarithms of the factors, and that 

logii - z) = -z - h ^ - ^ z^ - • • • - -2'- • • • = -5^;:^' 

if the absolute value of 2 is < 1. Hence 

00 



^-xW' 



if y is sufficiently small in absolute value to easure the convergence of 
each of the series used. The coefficient of i/* in (pit/ + • • • + Pny^'Y 
may be found by the multinomial theorem. Hence, after dividing r = n + 
• • • + r„ into the multinomial coefficient, we get 

(1/) S* =y -y— , —, Vi'^lh'' . . . P«S 

^^ ' I • ' 2 • • • • ' n • 

♦ Etlward Waring, Misc. Analyt.y 1762; ^feditationfs AlgebraiccB, 1770, p. 225, 3d ed., 
1782, pp. 1-4. No hint is given as to how Waring found (17); his proof was in effect 
by mathematical induction, being a verification that Skt sk-u . . . , 8i satisfy Newton's 
formukc. 

But (17) had been given earlier by Albert Girard, Invention nouveUe en Vcdgbbre 
Amsterdam, 1629. 
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where the sum extends over all sets of integers ri , . . . , fn, each ^ 0, for 
which 

(18) ri + 2r2 + 3r3 + • • • + nrn = fc. 
Here r! denotes 1 • 2 • 3 . . . r if r ^ 1, and unity if r = 0. 

I - 

V 9. Elementary Proof of Waring's Formula. Divide each member of 
(16) into the negative of its derivative; we get 

(19) - Pi - 2 Pay - - ' - npnV^-' _ ai ^ . . . + «- 



1 + Pi2/ + • • • + Pny" I - aiy 1 - any 

In the identity 

(20) Y^^i + Q + Q'+ • • • +Q*"'+r^' 

set Q = agy and multiply the resulting terms by ag. Hence the second 
member of (19) equals 

(21) si + S22/ + . . . + 5*2/*-i +_^^*(^L^^, 

1 + Pit/ + • • • + Pny"" 

the polynomial 4>(y) being introduced in bringing the fractional terms 

ai*"^V(l - «i2/)> 

etc., to the common denominator (16). 
In (20), we now set Q = — pii/ — • • • — pny"*. Thus 

i + ,,,+ .\.+,„,, - 1 (- iMPo/ + • • • + P»r)' + n^^.. 

where ^(y) is a polynomial. Expanding this rth power by the multino- 
mial theorem, we see that the left member of (19) equals 

^^ Ti! • • • Tni 

(d=Pi+2p2yH ), 

the sum extending over all integral values ^ of ri, r2, . . . , rn such 
that ri + • • • + Tn < A;, while -B is a fraction whose denominator is 
1 + Piy + • • • and whose numerator is the product of y* by a polynomial 
in y. In the expansion of the part preceding E, the terms with the factor y* 
may be combined with E after they are reduced to the same denominator 
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as E. The resulting expression* is now of the same general form as (21), 
so that the coefficient of y*"^ must equal «*. This coefficient is the sum of 

(n + 2r, + • • • +nrn = k— 1), 
(n + 2rj + • • • +nrn = k- 2), 

^^ ' 1 • • • • , ' n • 

(ri + 2r2+ • • • +nr» = fc-3), 



In the first sum employ the summation index ri + 1 instead of ri; in 
the second sum, r^ + 1 instead of r2; etc. We get 



where now (18) holds for each sum. Adding these sums, we evidently 
get the second member of (17). 

Example 1. Let n = 3, A; = 4. Then n + 2 rj -h 3 rs = 4 and 
(ri, ra, ra) = (4, 0, 0), (2,1,0), (1,0,1), (0,2,0), 

/V 2' 1' 1' \ 

** = H^i '"* - 2!T! '"""^ + rrii ''■^'' + 2] ^'V 

= pi^ - 4 pi'p2 + 4 pipj + 2 />2^ 

* The difference between it and (21) is an expression of the form (21). Suppose there- 
fore that an ejq^rcssion (21) is identically zero. Taking y = 0, we get ai * 0. The 
quotient by y is identically zero. Then 82 — 0, etc. 
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Example 2. Let n = 2 and write p for — pi, g for pt, r for rj. Then n = A; — 2r. 
If K is the largest mteger = k/2, the sum of the kih powers of the roots of 
x^ — px -{- q = \a 






^ I. *-i , A; (A: ~ 3) , , , fc(^-4)(fc-5) ^,_,^, . 
= p*-A;p*-*5H j72~*P ^ 1T273 ^ V+ • • • . 

10. t Certain Equations Solvable by Radicals. Regarding p as a vari- 
able and g as a constant, denote the polynomial in the preceding Ex. 2 
by F{p), The equation F{p)' == c, where c is an arbitrary constant^ can he 
solved by radicals. Indeed, if x is a particular root of x* — px + g = 0, the 
second root is q/Xj and 

This expression in x is therefore the result of replacing p by a; + q/x in 
F(p), as shown by the quadratic equation. Hence F(p) = c then becomes 

xk + fS\ =c, x2* - ex* + g* = 0. 
Solving this as a quadratic equation for x*, we get 



^-l^s/i-^' 



2-^ V 4 
Since the.product of these two expressions is g*, definite values 

can be chosen so that pe = q. Hence if € be a primitive kth root of unity, 
the 2fc values of x can be separated into pairs p€**, (re*"*" (m = 0, 1, . . . , 
A; — 1), such that the product of the two in a pair is pa = q. Now x + q/x 
is a value of p. Hence the k roots p of F{p) = c are 

pc*" + (r€*-»" (m = 0, 1, ...,&- 1). 

Thus F(p) = e can be solved by making the substitution 

For fc = 3, the equation is p' — 3 ^ = e and the present method be- 
comes that in Ch. Ill for solving a reduced cubic equation. 



76 THEORY OF EQUATIONS [Ch. vil 

EXERCISES 

l.t Solve DeMoivre's quintic p'^ — 5 qp^ + 5 q^p = c for p. 

2.t Solve p^ — 4 gp* + 2 g* = c for p by this method. 

3.t Write down a solvable equation of degree 7. Solve it. 

4.t Solve t/*^ + 10 2/» + 20 1/ + 31 = 0. 

11. Polynomials Symmetric in all but One of the Roots. If P is a 

polynomial in the roots of an equation fix) = of decree n and if P is sym- 
metric inn — \ of the roots, then P equals a polynomial in the remaining root 
and the coefficients of P and f{x). 

For example, P = 3 ai + aa* + 03^ +•••+ an* is such a polynomial and 

P = Sai* + 3 ai - ai^ = pi2 - 2p2 + 3 ai - ai^. 

If a is the remaining root, P is synmietric in all of the roots of the equa- 
tion (6) of degree n — 1, whose coefficients are polynomials in a, pi, . . . , 
Pn. Hence (§ 3) P equals a polynomial in a, pi, . . . , pn and the coeffi- 
cients of P. 

Example 1. If a, /S, 7 are the roots of /(x) a a:* + px* + qx -\- r = 0, find 

a-{-fi a + fi a + 7 /3 + 7 

Since /3* -h 7* = p2-2g-a*. /9-h7=-p-a, 

X 024-/32 ^ p2_2^-«« ^/ , 2q \ _ , _ ^ 1 

a-|-/3 ^^— p — a ^^\ cl^PI ^^ol -f p 

But a -H p, /3 4- P, 7 + P are the roots 2/1, 2/2, 2/3 of the cubic equation obtained from 
/(x) = by setting x + p = 2/, i.e., x = y — p. The resulting equation is 

7/3 - 2p2/2 + (/>' + g)l/ 4- r - pq = 0. 

Since we desire the sum of the reciprocals of ?/i, t/i, r/j, we set j/ = 1/z and find the 
sum of the roots 21, Ztj Zz of 

1 - 2p2 4- (p' 4- 9)2' 4- (r - p^)2» = 0. 
Hence 

S I ^ y 1 = Vz = Z'i±-?, y ^ + ^ ^ 2(7^ ~2p^y 4- 4 pr 
a4-p ^yi ^^ Pq — r' ^at4-/3 P9 — r 
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Example 2. If xi, . . . , Xn are the roots of f(x) = 0, find 



^xi4 



+ c 

First, Xi + c = yi, . . . , Xn + c = ^n are the roots of 

/(-c + 2/)=/(-c)+2//'(-c)+2/K)+ . . . =0. 
Next, 1/^1 = 2i, ... , l/^n = 2;n are the roots of 

2'y(- c) 4- e'^-y'C- c) 4- 2"-2( )+...= 0, 
obtained by setting y = 1/zin the preceding equation. Hence 



^xi + c ^ ' f{-c) 



EXERCISES 
[In Exs. 1-14, a, /8, 7 are the roots of /(x) = x* + px^ + ?x + r = 0.] 

1. Find V — ; — by means of the last formula. 

^a + p 
Using I3y + a{fi + y) = ?, find 

4. Why would the use of fiy = —r/a complicate Exs. 2, 3? Verify 

-r /(«) - r 
^7 = — = = a^ + pa + q. 

a a 

? 

6. Show that the last simi equals X{y/fi). 

7. Find 2j(/3 + 7)^. 8. Fmd 2J (a + /? - 7)'. 9. Find Jj fe^Y* 

10. Find a necessary and sufficient condition on the coefficients that the roots, 

112 —3 r 

in some order, shall be in harmonic progression. If - H — = -, then 6 = 0. 

a y 0' q 

and conversely. But 

11. Find the cubic equation with the roots 0y , ay , a0 . Hint: 

a y 

since these are (— r — l)/a, etc., make the substitution (— r — l)/x = y. 

Find the substitution which replaces the given cubic equation by one with the 
roots 
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12. a0 + ay, a/8 + /Sy, ay + fiy. 

13. — ; , etc. 14. — ; ;;— .etc. 

fi + y-a l5 + y-2a' 

If er, /8, 7, 5 are the roots of x* + px^ + gx* -|- rx + s = 0, find 

15. 2; ^ + ^ + 5* ^^- ^^ + 7 + 5-3' 

17. If i/i, 2/2, ys are the roots oi y^ + py + q = 0, the equation with the roots 
zi = (2/2 - 2/8)^ «2 = (2/1 - 2/j)S 28 = (yi - 2/2)^ is 

23 + 6p2;2 4. 9p22 + 4ps ^ 2752 = 0. 

Hints: since Zi = Z?/i' —2 yiyi — yi^ = — 2 p + 2 q/yi — 2/1*, etc., we set « = 
— 2 p + 2 5/2/ — 2/2. By the given equation, y^ + p -\- q/y = 0. Thus the 
desired substitution is 2 = — p + 3 g/?/, y = Sq/(z -{- p), 

18. Hence find the discriminant of the reduced cubic equation. 

12. t Cauchy's Method for Symmetric Functions. If Xi, . . . , Zn are 
the roots of (2), any polynomial P in Xi, . . . , x„ can be expressed as a 
polynomial in X2, . . . , Xn, pi, . . . , p^ in ever>' term of which the expo- 
nent of Xo is less than 2, the exponent of X3 less than 3, . . . , the exponent 
of Xn less than n. To this end, we first eliminate Xi by using Sxi = — pi. 
Then we eliminate X2*(fc ^ 2) by using the quadratic equation satisfied by 
X2 and having as coefficients polynomials in Xs, . . . , x^. This quad- 
ratic may be obtained by dividing /(x) by (x — Xs) . . . (x — Xn), or b}- 

noting that 

ari + X2 = — pi — X3 — • • • — Xn, 

X1X2 = P2 — (Xi + X2) (X3 + • • • + Xn) — X3X4 — ... — X»_iX„. 

Next, we eliminate X8*(fc ^ 3) by using the cubic equation obtained by 
dividing /(x) by (x — X4) . . . (x — Xn). Finally, we eliminate Xn*(fc = n) 
by using /(x») = 0. 

Example. To compute by this method the discriminant 

A = (xi - Xi)*(xi - xs)-(xf - XiY 

of f{x) a x* -|- px + 9 = 0, we note that Xi and Xi are the roots of 

-^^ = <?(x) « x« + XX, + x,» + p = 0. 
X — x« 

Since Sxi = 0, 

(xi - X,)* = (-2x2 - x,)« = 4Q(x,) - 3x,« - 4p= -3x,« - 4p, 

(x, - x,)(xj - Xf) = QCxs) = 3 x,« + P, 

A = (-3x,» - 4 p)(3x,» + p)» = -27(x,» + px,)' - 4p», 

A= -27g»-4p'. 
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We can now easily prove the fundamental theorem of § 3: if P is sym- 
metric in Xi, . . . , Xnt it equals a polynomial in pi, . . . , Pn- For, 
P = A + Bx2, where neither A nor B involves Zi or Xz. Since P is unal- 
tered when Xi and 0C2 are interchanged, 

A +3x2 = A + Bxi. 

UxiT^ X2j then B = 0; and, by continuity ,"B = even when Xi = Xj. Hence 

P = C + Dxz + Exz^ 

where C, D, E do not involve Xi, X2 or X3. Since P is unaltered when Xz 
and xi are interchanged, or when X3 and Xa are interchanged, the equation 

= C -P + Dy + Ey^ 

has the three roots Xi, X2, X3. Hence if Xi, X2, X3 are distinct, D = S = 0, 
P = Cy and by continuity these relations hold also if two or all three of 
these x's are equal, so that P is free of Xi, X2, X3. Similarly, P can be re- 
duced to a form which is free of each x»-. 

^ 13. t Tschimhausen Transformation. We can eliminate x between (2) 
and 

(22) X = uo + UiX + V/^ + . •. . -f u^_ix'*~^ 



and obtain an equation in X of degree n. First, from the expressions for 
X^, -y, . . . , we eliminate x**, x'*"'"^ ... by use of (2) and get 



(23) 



-X"^ = 1^0 + W21X H- U^7? + 



• • • 



+ Ui n-lX'^-S 



X" = tinO + t^nlX + Unl?? + 



... 



+ tin n-lX'^-S 

where the Wt-/ are polynomials in lio, . . . , Un^x and the coefficients of (2). 
In any one of these equations (23) we set x = Xi, . . . , x = Xn in turn 
and add the resulting relations. If Xi, . . . , Xn are the values of X for 
X ^ Xi, • • • y. X ^ Xn) set 

s* = Sxi*, & = SXi*. 
Then 

Si = niio + WiSi + V^% + • • • + Wn-lSn-l, 

S2 = ntiao -h ti2iSi + W22S2 + • ' • + t^ »-iSn-i, 



(24) 



Sn = nWnO + WnlSl + Wn2«2 + 



• • • 



+ Un n-lSn-l. 
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Since the elementary symmetric functions of Xi, . . . , Xn are expres- 
sible in terms of Si, S2, . . . , S» (§ 6), we can find the coefficients of the 
equation having the roots Xi, . . . , Xn'- 

(25) X- + PiX»-i + PiX--^ + . . . + Pn = 0. 

Another method of forming this equation is given in Ch. XII, § 9. 

If we seek values of i^o, Ui, . . . , Wn-i, such that Pi, P2, . . . , P* shall 
all vanish and therefore Si = S2 = • • • = S* = 0, by Newton's identities 
(7), we have only to satisfy a system of k equations [see (24)] homogeneous 
in lA), . . . , Un^i and of degrees 1, 2, . . . , fc, respectively. In partic- 
ular, Si = enables us to express Uo in terms of wi, . . . , so that 

(26) ^ = ^i(^-7i)+^(^-5)+ * • • +''"-^('^'*"'"^} 
Example. For n = 3, X = Ui{z — J «i) + utix^ — i «f), 

S2 = 2X1* = Ui'iSt - i 81*) + 2 UMSI - i «iS2) + wi*(«« - i «2*). 

Thus S2 = gives 

(3 Si - 81^) Ui = (si«2 - 3 «8 + V-3 A)u2, 

A s «oS284 + 2 «i«2Sj — SqSs' — «l'S4 — 82*. 

Hence the cubic equation is reduced to X* + Ps = by the substitution 
X = (sist - 3 «, + V-3 a) (3 X - «i) + (3 «2 - «i»)(3 x* - S2). 
By Ex. 6, p. 158, A is the discriminant of the cubic equation. 

EXERCISES 

l.t For n = 4, take mj = in (26) and find the cubic equation for M1/U2 which 
results from Ps = (i.e., Sz = 0, since Si = 0). The new quartic equation 
X* + P2-Y^ -|- P4 = may be solved in terms of square roots. 

2.t For n = 5, the condition for & = is that a certain quadratic form q in 
Wi, . . . , W4 shall vanish. Now q can be expressed as a sum of the squares of 
four linear functions L/ of ui, . . . , M4. Taking Li = 1L2, Lj = 1X4, where 
i* = — 1, we have S2 = 0. By means of the resulting two linear relations between 
wi, . . . , M4, we may express S3 as a cubic function of t/i, 1/2, for example. We 
must therefore solve a cubic equation in ui/ut to find the u's making also Sz = 0. 

The new quintic equation is X* + P4X + Pj = 0. If P4 ?^ 0, set X » y v^. 
Then y* + y + c =» 0. (Bring, 1786; Jerrard, 1834.) 



CHAPTER VIII 

Reciprocal Equations. Construction op Regular Polygons. 

Trisection op an Angle 

1. For certain types of equations, such as reciprocal and binomial 
equations, there exist simple relations between the roots, and these relations 
materially simplify the discussion of the equations. 

An equation is called a reciprocal equaiion if the reciprocal of each root 
is also a root. Apart from possible roots 1 and —1, each of which is its 
own reciprocal, the roots are in pairs reciprocals of each other. 

For example, the equation 

/(x) = (a;-l)(x2-}x+l)=0 

is a reciprocal equation having the roots 1, 2, i. If we replace x by 1/x and multi- 
ply the resulting function by x', we get — /(x). Here (1) holds f orn = 3 and for 
the minus sign. 

In general, if 

/(x) = x** + • • • + c = 

is a reciprocal equation, no root is zero, so that c 5^ 0. If r is any root of 
/(x) = 0, 1/r is a root of /(1/x) = 0, and hence of 



'■'©■ 



1 + • • • + ex** = 0. 



Since the former is a reciprocal equation, it has the root 1/r. Hence 
any root of the former equation is a root of the new equation. Thus, by 

(1) and (2) of Ch. VI, the left member of the latter is the product of /(x) 
by c. Then, by the constant terms, 1 = c*. Hence c = ±1 and 

(1) X-/Q ^ ±f{x). 

Thus if Pi x**""* is a term of /(x), also =t p^x* is a term. Hence 

(2) /(x) = x*' ± 1+ pi(x«-i it x) + P2(x«-* it X*) + • • • . 

81 
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If n is oddy n = 2 < + 1, the final term is 

and X it 1 is a factor of /(x). In view of (1), the quotient 

Qix) ^ /(^> 



.n— 1 



Xit 1 

has the property that 

Hence Q(x) = is a reciprocal equation of the type 

(3) X2' + 1 + Ci(x2'-1 + X) + C2(x2'-2 + X») + ' ' ' + C|X* = 0. 

Indeed, the highest power x^' of x has the coefficient unity and the con- 
stant term is unity, so that it is of the form (2) with the upper signs. 

If n is everij n = 2 /, and if the upper sign holds in (1), we have just seen 
that (2) is of the form (3). Next, let the lower sign hold in (1). Then 
Pt = 0, since a term ptx^ would imply a term —pix^. The final term in 

(2) is therefore 

pt-i(x'+i - x'-i). 

Hence /(x) has the factor x^ — 1. As before, the quotient is of the form (3). 
In each case we have been led to a reciprocal equation of type (3). 
The solution of the latter may be reduced to the solution of an equation of 
degree t and certain quadratic equati^yns. To prove this, divide the terms 
of (3) by X'. Then 

(4) (x' + 1) + c. (x-i + ^,) + r.(x'- + ^,) 

+ • • • +ct-i\x+ -j + ct=0. 

To reduce this to an equation of degree <, we set 

(5) x+ - = z. 
Then 

3:^ + ^ = ^-2, x» + ^ = 2»-32, . . . , 

while the general binomial in (4) can Ik? computed from 

(6) ^ + ^ = .(^-. + _l3.,)_(^-. + ^). 
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For example, 

X* + ^ = zCg' - 3 2) - (2« - 2) = 2^ - 42^ + 2. 

However, we can obtain an explicit expression for x* + 1/x* by noting 
that it is the sum of the fcth powers of the roots x, 1/x of 

y^ -'lz + -jy + a: • - = 2/2 - 21/ + 1 = 0. 

The sum of the fcth powers of the roots ol y^ — py + q ^ was found 
in Ex. 2, p. 75. Taking p = Zj g = 1, we have 

(7) x* + l = 2* - fc2*-* + ^^^^2*-4 _ fcL^(^IL5)^_,^ . . . 

,, k(,k-r-l)(k-r-2) . . ■ (fc-2r + l) ^_,. . 
+ (-1) 1.2.3. ..r «»2r+.... 

Hence (4) becomes an equation of degree t in 2. From each root 2 we 
obtain two roots x of (3), which are reciprocals of each other, by solving 
the quadratic equation x^ — 2x + 1 = 0, equivalent to (5). 

Example. Solve x*— Sx^ + Qa:* — 9x* + 5x — 1 = 0. Dividing by x — 1, 
we get X* - 4x3 + 5x2 - 4x + 1 = 0. Thus 



P-('-i) 



x* + -, -4lx + -J + 5 = 0, 2«-.42 + 3 = 0, 2 = lor 3. 

For 2=1, x«-x + l = 0, x=i(l±V^^. For 2 = 3, x*-3x + l = 0, 
X = J (3 d= V5). These with x = 1 give the five roots. 

EXERCISES 

Solve by radicals the reciprocal equations 

1. x«-7x* + x'-x2 + 7x-l = 0. 2. x*^=l. 

3. X* = 1. 4. x*^ + 1 = 0. 

5. Find the 2-cubic for x^ = 1. 

6. Find the 2-quintic for x^^ = 1. 

7. The 2-quartic f or X* = lis2* + 2»-322-22 + l = 0. It has the root -1 
since the 2-equation f or x* = 1 is 2 + 1 = 0. Verify that, on removing the factor 
2 + 1 from the quartic, we get the 2-cubic 2* — 3 2 + 1 = for (x* — l)/(x' — 1) = 0. 

8. What are the trigonometric representations of the roots of the 2-equations in 
Exs. 5 and 6? Hint: if x = cos ^ + t sin ^, 1/x = cos ^ — t sin ^. 
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2. Binomial Reciprocal Equations. A reciprocal equation with only 
two terms is of the form x** ifc 1 = 0. Its roots were expressed in terms 
of trigonometric fimctions in Ch. II. But now we wish to use only alge- 
braic methods.* We might proceed as in § 1, first ** removing the factor 
X ifc 1 (if n is odd) or x^ — 1 (if n is even and the lower sign holds), and 
then applying substitution (5) to obtain the 2J-equation. Except for special 
values of n (as those in Exs. 2-6, § 1), there is a more effective method, 
leading to auxiliary equations of lower degree than the 2-equation. For in- 
stance, it will be shown that x^' — 1 = can be solved in terms of square 
roots; it is only a waste of effort to form the ^-equation of degree 8. 

3. The new method will first be illustrated for x^ — 1 = since it then 
differs only in form from the earlier method of treating reciprocal equations. 
Removing the factor x — 1, we have 

(8) x« + x* + x^ + x' + x2 + x + l=0. 

If r is a particular root of (8), its six roots are (Ch. II, § 13), 

(9) r, r^, r^j r^, r^, r*. 

By the substitution (5), we obtain the cubic equation 

(10) 2» + 2*-22- 1 = 0, 

whose roots are therefore 

(11) zi = r + -=r + r^, 2, = r^ +;i = r« + r*, 2i = r» + ^ = r» + r*. 

The new method consists in starting with these sums of pairs of the 
six roots and forming the cubic equation having these sums as its roots. 
Since r is a root of (8), 

S2i = r + r« + . . . + r« = -1, XziZi = 2 (r + • • • + r«) = -2, 

2^12228 = 2 + r+ • • • +r* = l. 

Hence Zu Znt z^ are the roots of (10). If a root Zi be foimd, we can obtain 
r from the quadratic equation r* — 2ir + 1 =0. 

* It is an important fact, not proved or used here, that x* ± 1 = is solvable by 
radicals, namely, by a finite number of applications of the operation extrcuUian of a 
single root of a knoiim number. Cf. Dickson, Introduction to the Theory of Algthraic 
Equations^ John Wiley & Sons, pp. 77, 78. Note that it suffices to treat the case n 
prime, since x**^*" A is equivalent to the chain of equations y* = A, x** =■ y. 

** If n - p^, we may remove the factors x** d= 1 if p is odd. See Ex. 7, §1. 
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We can, however, find r by solving first a quadratic equation and after- 
wards a cubic equation. To this end, set 

(12) 2/1 = r + r2 + r^, 2/2 = r^ + r« + K 

Then 

2/1 + 2/2 = -1, 2/12/2 = 3 + r + . . . + r« = 2, 

so that yi and yi are the roots of 

y^ + y + 2 = 0. 

Then r, r^, r* are seen to be the roots of 

p' — 2/iP^ + 2/2P - 1 = 0. 

4. t The Periods. We now explain the principle discovered by Gauss 
by which we select the pairs from (9) to form the periods Zi, 22, 2:3 in (11), 
and the triples to form the periods 2/1, 2/2 in (12). To this end we seek an 
integer g such that the six roots (9) can be arranged in the order 

(13) r, r^, ro\ r^, r*^, r^^ 

each term being the fifth power of its predecessor. The choice g = 2 is 
not permissible, since the fourth term would then be r*= r. But we may 
take g = 3, and the desired order is 

(14) r, r', r^, r^ r^, r^, 

each term being the cube of its predecessor. To form the two periods 
j/i and 2/2, each of three terms, we take alternate terms of (14). To form 
the three periods 21, 22, 23, each of two terms, we take any one of the first 
three terms (as r^) and the third term after it (then r^), 

6.t Solution of x^'^ = 1 by Square Roots. Let r be a root 7^ 1. 

Then 

7.17 _ 1 

i -i = ri« + ri5 + • . • + r + 1 = 0. 

r — 1 

As in § 4, we may take g == 3 and arrange the roots, r, . . . , r^* so that 
each is the cube of its predecessor: 

r, r^, r», r^^, r^^, r^, r^\ r", r^\ r", r», r\ r*, r", r\ r\ y '1 ^ 

Taking alternate terms, we form the 2 periods each of 8 terms: 

2/1 = r + r' + r" + r^^ + r^* + r* + r* + r*, 

y^=,r* + r^^ + r^ + r^^ + r^* + r'^ + r^ + r^. 
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Hence 2/1 + 1/2= —1. We find that 1/12/2 = 4 (r + • • • +r")= —4. Thus 

(15) 2/1, 2/2 satisfy 2/^ + 2/ - 4 = 0. 
Taking alternate terms in 2/1, we form the two periods 

^1 = r + r^' + ri« + r*, 22 = r» + r« + r» + r*. 

Taking alternate terms in 2/2, we form the two periods 

u;i = r> + r** + r" + r^^, tx?2 = ri<> + r*^^ + r' + r^. 

Thus Zi + Z2 = Vh Wi + W2=^ 2/2' We find that 21^2 = Wiio^ = — 1. 
Hence 

(16) ziy 1^ satisfy 2^ — jfiz — 1 = 0, 

(17) Wiy W2 satisfy ic^ — y2W —1 = 0. 
Taking alternate terms in Zi, we have the periods 

Vi = r + r", ^2 = r^* + r*. 
Now, 1^1 + ^2 = zij vit^ = ti?i. Hence 

(18) vij tH satisfy t^ — Ziv +Wi= 0, 

(19) r, ri« satisfy p? - vip + 1 = 0. 

Hence we can find r by solving a series of quadratic equations. Which 
of the sixteen values of r we shall thus obtain depends upon which root 
of (15) is called 2/1 and which 2/2, and similarly in (16)-(19). We shall now 
show what choice is to be made in each such case in order that we shall 
finally get the value of the particular root 

r = cos-jy H-ism-jy 

Then 

1 2t . . 2t , 1 ^ 2t 

- = cos 7=- — ismr^r; ri = r + - = 2 cos -pr f 

T- 17 17 T 17 

J OTT . . Oir J I 1 e\. O IT 

r* = cos^ + ism^» i;j = r* +- = 2cos^- 

Hence vi > vj > 0, and therefore Zi > 0. Similarly, 

t/?i = r» + j5+r* + ^ = 2cos jy + 2cos-jy- = 2cos^-2co8^> 0, 

^ 6t , rt lOir , -. ' 12 IT , ^ Htt . ^ 
1/2 = 2 cosyy + 2 cos-^ + 2 cos -p^ + 2 cos-j=- < 0, 



Id 
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Bmce only the first oosine in i/z is positive and it is numerically less than 
the third. But y^ = -4. Hence i/, > 0. Thus (15)-(17) pve 

Ih = J (Wf - l), 9! = J (- vl7 - l), 

2i = § j/i + Vi +■ i j/i", wi = J »s + vT+TJ^- 

We now have the coefficients of (18) and know that ni > ki > 0, These 
results are sufficient for the next problem. Of course, we could go on 
and obtfun the explicit expression for Vi and that for r in terms of square 
roots. 

6.t Coostniction of a Regular Polygon of 17 Sides. In a circle of 
radius unity, construct two perpendicular diameters AB, CD, and draw 
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tangents at A, D, which intersect at S (Fig. 20). Find the point E in 
j4.S for which AE = J AS, by means of two bisections. Then 

AE = i, OE-i Vu. 
Let the circle with center E and radius OE cut AS at F and F'. Then 

AF - EF - EA - OE - i - iyi, 

Ar-EF'+EA^OE + i- -is,, 

OF . VoaF+aP - VT+Ts?. OF' - VI + i J,,'. 
Let the circle with center F and radius FO cut AS at H, outside of F'F; 
that with center F' and radius F'O cut AS aX H' Ijetween F' and F. Then 

AH = AF + FH - AF + OF - J », + Vl + in," = z,, 

XH'- F'H' - F'A - OF' - AF' - i»,. 
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It remains to construct the roots of equation (18). This will be done 
as in Ch. I, § 16. Draw HTQ parallel to AO and intersecting OC pro- 
duced at T. Make TQ = AH\ Draw a circle having as diameter the 
line BQ joining B = (0, 1) with Q = (21, Wi), The abscissas ON and OM 
of the intersections of this circle with the x-axis OT are the roots of (18). 
Hence the larger root Vi is OM = 2 cos 2 ir/17. 

Let the perpendicular bisector LP of OM cut the initial circle of unit 
radius at P. Then 



cos LOP = OL = cos 



17 



LOP= 



17 



Hence the chord CP is a side of the inscribed regular polygon of 17 sides, 
constructed with ruler and compasses. 



MB 



EXERCISES 

5, ^ = 2, the periods are r + r^^ r^ + r^. Show that they are the 

roots of the 2-quadratic obtained in Ex. 2, p. 83. 

2.t For n = 13, find the least g, form the three 

periods each of four terms, and find the cubic having 

them as roots. 

3. For n = 5, Ex. 1 gives r -|- r* = 2 cos 2 t/5 = 

i (V5 — 1). In a circle of radius unity and center 
draw two perpendicular diameters AOA'j BOB\ 
With the middle point M of OA' as center and radius 
MB draw a circle cutting OA at C (Fig. 21). Show 
that OC and BC arc the sides Sio and Sh of the 
inscribed regular decagon and p>entagon respectively. 
Hints: 

JV5, OC =i(V5-l), i?C= Vl + OC»= JV10-2V5, 




«io = 2sin 18° = 2 cos - = OC, 

o 



8j* = (28m36°)» = 2^ - cos^] = J (lO - 2 Vs), Sj = BC. 

7. t Regular Polygon of n Sides. If n be a prime such that n — 1 is 
a power 2* of 2 (as is the case when n = 3, 5, 17), the n — 1 imaginary nth 
roots of unity can l>e separat<?d into 2 sots each of 2*~^ roots, each of these 
sets subdivided into 2 sets each of 2*"* roots, etc., until we reach the 
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sets r, 1/r and r^, l/r^, etc., and in fact * in such a manner that we have a 
series of quadratic equations, the coefficients of any one of which depend 
only upon the roots of quadratic equations preceding it in the series. 
Note that this was the case for n = 17 (§ 5) and for n = 5. It is in this 
manner that it can be proved that the roots of x" = 1 can be found in 
terms of square roots, so that a regular polygon of n sides can be inscribed 
by ruler and compasses, provided n be a prime of the form 2* + 1. 

If n be a product of distinct primes of this form, or 2* times such a prod- 
uct (for example, n = 45, 30 or 6), or if n = 2*" (m > 1), it follows readily 
that we can inscribe by ruler and compasses a regular polygon of n sides. 
But this is impossible for other values of n. This impossibility will be 
proved forn = 7 and n = 9, the method of proof being appUcable to the 
general case. 

8. Regular Polygons of 7 and 9 Sides; Trisection of an Angle. For 

brevity we shall occasionally use the term " construct " for " construct 
by ruler and compasses." If it were possible to construct a regular poly- 
gon of 7 sides and hence angle 2 ir/7, we could construct a line of length 

2 cos 2 ir/7, the base of a right-angled triangle whose hypotenuse is of 
length 2 and one of whose acute angles is 2 7r/7. Set 

27r , . . 27r 
r = cos-=- + I sm — • 

Then 

1 27r . . 27r , 1 ^ 27r 

- = cos -= 1 sm -- > r + - = 2 cos -=- • 

r 7 7 r 7 

Hence 2 cos 2ir/7 is a root of the cubic equation (10). This equation has 
no rational root. For, if it had a rational root, it would have (Ch. VI, 
§8, §5) an integral root which is a divisor of the constant term —1, 
whereas neither + 1 nor — 1 is a root. Hence we shall know that it is im- 
possible to construct a regular polygon of 7 sides by ruler and compasses 
as soon as we have proved (§ 10) the next theorem. 

* See the author's article *' Constructions with ruler and compasses; regular poly- 
gons," in Monographs on Topics of Modem McUhemcUicSj edited by J. W. A. Young, 
Longmans, Green and Co., New York, 1911, p. 374. In addition to the references there 
given (p. 386), mention should be made of the book by Klein, Elementarmathematik vom 
Hoheren Standpunkte ausj Leipzig, 1908, vol. 1, p. 125; and cd. 2, 1911. 
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Theorem. It is not possible to construct by ruler and compasses a line 
whose length is a root of a cvbic equation with rational coefficients but having 
no rational root. 

This theorem shows also that it is not possible to construct a regular 
polygon of 9 sides and hence that it is not possible to construct the angle 
40° by ruler and compasses. Indeed, if r = cos 40° + i sin 40°, then 

r + 1/r = 2 cos 40° is a root (Ex. 7, p. 83) of 

2»-32+l=0. 

The same equation follows also from the identity 

cos 3 A = 4 cos^ A — 3 cos A 

by taking A = 40°, replacing cos 120° by its value —J, and setting 
2 = 2 cos 40°. Since neither divisor 1 nor — 1 of the constant term is a 
root of the 2-cubic, there is no rational root. 

Corollary. It is not possible to trisect every angle by ruler and compasses. 

Indeed, angle 40° cannot be constructed, while angle 120° can be. 

9. Duplication of a Cube. Another famous problem of antiquity was 
the construction of a cube whose volume shall be double that of a given 
cube. Take the edge of the given cube as the unit of length and denote 
by z the length of an edge of the required cube. Then x* — 2 = 0. 
Since no one of the divisors of 2 is a root of this cubic equation, the theorem 
stated in § 8 implies the impossibility of the duplication of a cube by 
ruler and compasses. 

10. t Cubic Equations with a Constructible Root It remains to prove 
the theorem in § 8 from which we have drawn such important conclusions. 
Suppose that 

(20) 2? + ax^ + Px + y = (a, /3, y rational) 

is a cubic equation having a root Xi such that a line of length Xi or — Xi 
can be constructed by ruler and compasses. We shall prove that one of 
the roots of (20) is rational. 

The construction is in effect the determination of various points as the 
intersections of auxiliary straight lines and circles. Choose rectangular 
axes of coordinates. The coordinates of the intersection of two straight 
lines are rational fimctions of the coefficients of the equations of the two 
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lines. To obtain the coordinates of the intersection of the straight line 
y = mx + b with the circle 

(x - p)2 + (y - g)2 = r^, 

we eliminate y and obtain a quadratic equation for x. Thus Xj and hence 
also 2/, involves no irrationaUty (besides irrationalities in m, 6, p, g, r) other 
than a square root. Finally, the intersections of two circles are given 
by the intersections of one of them with their common chord, so that this 
case reduces to the preceding. Hence the coordinates of the various points 
located by the construction, and therefore also the length db Xi of the seg- 
ment joining two of them, are found by a finite number of rational 
operations and extractions of real square roots, performed upon rational 
numbers and numbers obtained by earlier ones of these operations. 

If Xi is rational, (20) has a rational root as desired. Henceforth, let Xi 
be irrational. Then Xi is the quotient of two sums of terms, each term 
being a rational number or a rational multiple of a square root. A term 
may involve superimposed radicals as 

r = VlO - 2 V5, 8 = VlO + 2 V5, t = \/4 - 2 Vs. 

But t equals VS — 1 and would be replaced by that simpler value. As a 
matter of fact, r is not expressible rationally * in terms of a finite number 
of square roots of rational numbers, and is said to be a radical of 
order 2. A term having n superimposed radicals is of (}r(f^ ^ ^^ '^ '*» n^t 
expressible rational ly in tfrma nf radipAla each with fewer than n supe r- 
imposed radical s. In case Xi = 2 r — 7 s, we would express Xi in the form 
2 r — 28 V5/r, involving a single radical of order 2; indeed, rs = 4 Vs. 
If Xi involves VS, Vs and Vl5, we replace Vl5 by Vs • Vs. 

We may therefore assume that no one of the radicals of highest order 
n in Xi is a rational fimction with rational coefficients of the remaining 
radicals of order n and radicals of lower order, that no one of the radicals 
of order n — 1 is a rational function of the remaining radicals of order 
n — 1 and radicals of lower order, etc. 

Let Vk be a radical of highest order n in Xi. Then 

a + bVk 

Xi = y=t 

c + dVk 

* That is, as a rational integral function with rational coefficients. 
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where a, . . . , d do not involve Vk, but may involve other radicals of 
order n. If d 9^ 0, Vk 9^ c/d, in view of the preceding assumption. 
Thus we may multiply the numerator and denominator of Xi by c — d Vk. 
Hence, whether d 5*^ or d = 0, we have 

Xi = e+fVk (/5^0), 

where neither e nor / involves Vk. Since Xi is a root of (20), we have 
A + B Vk = 0, where A and B are polynomials in e, /, k, a, /3, 7. li B 9^ 0, 
we could express Vk as a rational function —A/B of the remaining 
radicals in the initial Xi. Hence B = and therefore A = 0. But the 
result of substituting e —f Vk for x in the cubic function (20) is evidently 
A — B Vk, Hence 

X2 = e — f Vk 

is a new root of our cubic equation. The third root is 

X8= — a — Xi — a?2= — a — 2 c. 

Now a is rational. If e is rational, Xs is a rational root of (20), as desired. 
The remaining case is readily excluded. For, if e is irrational, let Vi be 
one of the radicals of highest order in e. Then, as above, 

X3 = fif + A Vs (A 5^ 0), 

where neither g nor h involves Vs, while g — A Vi is a root 3^ Xs of (20), 
and hence identical with Xi or Xg. Thus 

e±fVk = g-hVs. 

Now Vs and all the radicals appearing in g. A, a occur in Xs and hence 
in e. But Vk is not expressible in terms of the remaining radicals of Xi. 
We have now proved that if the constructiblp root Xi of (20) is irra- 
tional, there is a rational root X3. 

11. t Problems such as the trisection of any angle can often be solved 
by means of certain curves. We note, however, that there exists no plane 
curve, other than a conic section, whose intersections by an arbitrary 
straight line can be found by ruler and compasses.* 

* J. Petersen, Algebraische GleichungeTif p. 169. 



CHAPTER IX 

Isolation op the Real Roots of an Equation with Real 

Coefficients 

1. Method of RoUe.* There is at least one real root of f'{x) = 6e- 
tween two consecutive real roots a and b of f(x) = 0. 

For, the graph of y = f{x) has a bend point between a and b. 

Corollary. Between two consecutive real roots r and s of f\x) = 0, 
lies at most one real root of f{x) = 0. 

For, if there were two such real roots a and b of the latter equation, the 
first theorem shows that/'(a:) = would have a real root between a and b 
and hence between r and s, contrary to hypothesis. 

Now /(x) = has a real root between r and s if /(r) and /(«) have oppo- 
site signs (Ch. I, § 12). Hence the Corollary gives the 

Criterion. If rand s are consecutive real root s off\x) = 0, thenf(x) = 
hxis a single real root between r and s if and only if f{r) and f(s) have opposite 
signs. At most one real root of f(x) = is greater than the greatest real root 
off{x) = Oj or less than the least real root of f'{x) = 0. 

The final statement follows at once from the first theorem. 

Example. For/(x) = Sx* - 25x3 + 60x - 20, 

^f\x) = X* - 5 x2 + 4 = (x« - l)(x2 - 4). 
Hence the roots of /'(x) = are =tl, ±2. Now 

/(-«)=-«), /(-2)=-36, /(-l)=-58, /(1) = 18, /(2)=-4, /(+oo)=+«). 
Hence there is a single real root in each of the intervals 

(-1,1), (1,2), (2, +00), 
and two imaginary roots. The 3 real roots are positive. 

2. The first theorem of § 1 is a special case of 

RoUe's Theorem. Between two consecutive roots a and b of f{x) = 0, 
there is an odd number of real roots of f\x) = 0, a root of mvUiplicUy m 
being counted as m roots. 

♦ TraUi de Valgbbre, Paris, 1690. Hudde knew the method in 1659. 
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We may argue geometrically, noting that there is an odd number of 
bend points between a and 6, the abscissa of each being a root of /'(x) = 
of odd multiplicity, while the abscissa of an inflexion point with a hori- 
zontal tangent is a root oi f\x) = of even multiplicity. 

To give an algebraic proof, let 

fix) ^{x- aYix - bYQix), a<b, 

where Q{x) is a polynomial divisible by neither x — a nor a5 — 6. Then 

^^ 'y = r(x - 6) + s(x - a) + (x - a)(x - &)^^- 

The second member has the value r{a — 6) < for x = a and the value 
sQ) — a) > for X = 6, and hence vanishes an odd number of times be- 
tween a and b (Ch. I, § 12). But, in the left member, (x — a)(x — b) and 
/(x) remain of constant sign between a and 6, since /(x) = has no root 
between a and 6. Hence /'(x) vanishes an odd number of times. 

Corollary. If /(x) = has only real roots, f'{x) = has only real 
roots distributed as follows: an (m — l)-fold root equal to each m-fold 
root of /(x) = for rn ^ 2; a single root, which is a simple root, between 
two consecutive roots of /(x) = 0. 

For, if the roots of /(x) = are a, 6, c, . . . , arranged in ascending 
order, of multiplicities r, s, <, . . . , respectively, then a, 6, c, . . . are 
roots of /'(x) = of multiplicities r — 1, s — 1, < — 1, . . . , and between 
a and 6 lies at least one real root of /'(x) = 0, etc. The number of these 
roots oi f'{x) = is thus at least 
(r-l) + l + (s-l) + l+(<-l)+ • • • =r + 8 + t+ • • . -l=n-l, 

if n is the degree of /. But /' is of degree n — 1 and hence has only these 
roots. Thus only one of its roots lies between a and b. 

EXERCISES 

1. x* — 5x + 2 = has 1 negative, 2 positive and 2 imaginary roots. 

2. x^ + j — 1=0 has 1 negative, 1 positive and 4 imaginary roots. 

3. x* — 3j^ + 2j* — 5 = has two imaginary roots, and a real root in each 
of the inter\'als (-2, -1.5), (-1.5,- 1), (1, 2). 

4. fix) =4x* — 3x^ — 2x* + 4x — 10 = has a single real root. I£nt: 

Fix) = l/'(x) = 5x^-3x»-x+l=0 

has no real root, since F'(x) = has a single real root and for it F is positive. 

5. If /^*Hx) = has imaginar>' roots, /(x) = has imaginary roots. 

6. If f'(x) = has exactly r real roots, the number of real roots of fix) = is 
r + 1 or is less tlian r + 1 by an even number, a root of multiplicity m being 
counted as m roots. 
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3. Sturm's Method. Let f(x) = be the given equation with real 
coefficients, and f{x) the first derivative of /(x). The first step of the 
usual process for seeking the greatest common divisor of f(x) and f{x) 
consists in dividing / by /' until we obtain a remainder r(x), whose degree 
is less than that of /'. Then, if ^i is the quotient, we have / = gi/' + r. 
We write /2 = — r, divide /' by /2, and denote by /s the remainder with its 
sign changed. Thus 

/ = 5i/' — /2, /' = 52/2 - /a, fi = gs/s — A . . . . 

The latter equations, in which each remainder is exhibited as the nega- 
tive of a polynomial /», yield a modified process, just as effective as the 
former process, for finding the greatest conmion divisor G of /(x) and/'(x) 
if it exists. 

Suppoee that — /* is the first constant remainder. If /i == 0, then /a = G, since 
ft divides fi and hence also /' and / (by using our equations in reverse order) ; while 
conversely, any conunon divisor of / and /' divides /2 and hence also /a. 

But if /i is a constant ?^ 0, / and /' have no conmion divisor involving x. This 
case arises if and only if fix) = has no multiple root (Ch. I, § 7), and is the only 
case considered in §§ 4-6. 

Before stating Sturm's theorem in general, we shall state it for a numerical 
case and illustrate its use. 

Example, /(x) = x' + 4x2 - 7. Then/' = 3x2 + 8x, 

/=(lx + i) /'-/«, /2-¥x + 7, 

/' = (Hx + ,^2W2-/3, fz = mi 

For X = 1, the signs of /, /', /2, /a are h + +, -showing a single variation of 

consecutive signs. For x = 2, the signs are + + + +, showing no variation of 
signs. Sturm's theorem states that there is a single real root between 1 and 2. 

For X = —00, the signs are 1 h, showing 3 variations of signs. The 

theorem states that there are 3 — 1 = 2 real roots between — 00 and 1. Similarly, 



z 


Signs 


Variations 


- 1 

-2 
-3 
-4 


1 ++ 1 
+ + 1 1 

+ + + + 


1 

2 
2 
3 



Hence there is a single real root between —2 and —1, and a single one between 
—4 and —3. Each real root has now been isolated since we have found two num- 
bers such that a single real root lies between these two numbers or equals one of 
them. 
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4. Sturm's Theorem. Let f(x) = 6e an equation with real coefficients 
and vnthout multiple roots. Modify the usual process for seeking the greain 
est common divisor of f(x) and its first derivaiive * fi(x) by exhibiting each 
remainder as the negative of a polynomial fi: 



(1) f = Qlfl- fh /l = 52/2 - /s, /2 = qzfz - /l, . . . , fn-2 = Qn-lfn-l - / 



ny 



where ** fnis a constant 9^ 0. If a and b are real numbers, a <b, neither 
a root of f(x) = 0, the number of real roots of f{x) — between a and b equals 
the excess of the number of varialions of signs of 

(2) f{x), /i(x), /2(x), . . . , /.-i(x), /n 

for X = a over the number of variations of signs for x = b. Terms which 
vanish are to be dropped oui before counting the variations of signs. 

For brevity, let Vx denote the number of variations of signs of the num- 
bers (2) when x is a particular real number not a root of f{x) = 0. 

^ First, if Xi and Xg are real numbers such that no one of the continuous 
functions (2) vanishes for a value of x between Xi and X2 or for x = Xi or 
X = X2, the values of any one of these functions for x = Xi and x = Xi 
are both positive or both negative (Ch. I, § 12), and therefore Vx^ = Fx,. 

^L Second, let p be a root of /t(x) = 0, where I ^ i < n. Then 

(3) fi-i{x) = qifi{x) - /,+i(x) 

and the equations (1) following this one show that /i-i(x) and /i(x) have 
no common divisor involving x (since it would divide the constant /«). 
By hypothesis, /»(x) has the factor x — p. Hence /»-i(x) does not have 
this factor x — p. Thus, by (3), 

/i-i(p) = -/i+i(p) 9^ 0. 
Hence, if p is a sufficiently small positive number, the values of 

/i-i(x), /»(x), fi^iix) 

for X = p — p show just one variation of signs, since the first and third 
values are of opposite signs, and f or x = p + p show just one variation of 

* The notation /i instead of the usual /', and similarly /o instead of /, is used to reg- 
ularize the notation of all the /'s, and enables us to write any one of the equations (1) 
in the single notation (3). 

** If the division process did not yield ultimately a constant remainder 7^ 0, / and/i 
would have a common factor involving x, and hence f(x) =■ a multiple root. 
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signs, and therefore show no change in the number of variations of sign 
for the two values of x. 

It follows from the first and second cases that Va = V^ ii a and P are 
real numbers for neither of which any one of the functions (2) vanishes and 
such that no root of f(x) = lies between a and p. 
-fit Third, let r be a root of f(x) = 0. By Taylor's Theorem (8) of Ch. I, 



/(r-p) = -pf(r) + ip2r(r)- . . . , 
/(r + p)= pr{r) + hpT(r)+ .... 

If p is a sufficiently small positive number, each of these polynomials in p 
has the same sign as its first term. For, after removing the factor p, 
we obtain a quotient of the form ao + s, where s = aip + OaP^ + . . . 
is numerically less than ao forall values of p sufficiently small (Ch. I, 
end of § 11). Hence if /'(O is positive, /(r — p) is negative and/(r + p) 
positive, so that the terms /(x), fi{x) = f\x) have the signs — \- for 
z = r — p and the signs + + for x = r + p. If /'(r) is negative, these 

signs are H and respectively. In each case, /(x), /i(x) show one 

more variation of signs for x = r — p than for x = r + p. Evidently p 
may be chosen so small that no one of the functions /i(x), . . . , /n vanishes 
for either x = r — p or x = r + p, and such that /i(x) does not vanish 
for a value of x between r tt Z^e^^^d rrf^^^g^^ that /(x) = has the single 
real root r between these limitsXlf'l)? Hence by the first and second* 
cases, /i, ...,/„ show the same niunber of variations of signs for x = r — p 
and X = r + p. Thus, for the entire series of functions (2), we have 

(4) Vr-p - 7.+P = 1.^ 

The real roots of /(x) = within the main interval from a to 6 (i.e., the 
aggregate of numbers between a and b) separate it into intervals. By 
the earlier result, Vx has the same value for all numbers in the same 
interval. By the present result (4), the value of Vx in any interval ex- 

* The argument in the second case when applied f or i = 1 requires the use of 
/o = / and hence does not indicate the variations in a series lacking /. To avoid the 
necessity of treating this case i = 1, we restricted p further than done at the outset so 
that /i(j;) shall not vanish between r ^ p and r -f- p. This necessary step in the proof is 
usually overlooked. Moreover, we have not adopted the usual argument based upon 
the continuous change of x from a to 6, in view of the ambiguity of Vg when x is a root 
of f(x) = 0, etc. 
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ceeds the value for the next interval by unity. Hence Va exceeds Vb by 
the number of real roots between a and b. 

COROLLABY. If a < 6, Fa - Vt. 

EXERCISES 

Isolate by Sturm's theorem the real roots of 

1. x» + 2x + 20 = 0. 2. x» + a;-3 = 0. 

5. Simplifications of Sturm's Functions. In order to avoid fractions, 
we may first multiply f(z) by a positive constant before dividing it by 
/i(x), and similarly multiply /i by a positive constant before dividing it by 
/j, etc. Moreover, we may remove from any /» any factor fc< which is 
either a positive constant or a polynomial in x positive f or * a ^ x ^ &, 
before we use that fi as the next divisor. 

To prove that Sturm's theorem remains true when these modified 
functions /, f i, . . . , Fm are employed in place of functions (2), consider 
the equations replacing (1) : 

/i = kiFiy cj = qxFi - fcjFj, CjFi = qtFt - fcjFa, 

in which C2, Cs, . . . are positive constants and Fm is a constant 3^ 0. A 
common divisor (involving z) of F,_i and Fi would divide F,-f, . . . , 
f 2, Fi, /, /i, whereas f{x) = has no multiple roots. Hence if p is a root 
of Fi{x) = 0, then f »-i(p) 9^ and 

c,H-iFi_i(p) = -fci+i(p) F,+i(p), c+i > 0, fc.+iCp) > 0. 

Thus F»_i and Fi+i have opposite signs for x = p. We proceed as in § 4. 

Example 1. If fix) = x* + 6 x — 10, /i = 3 (x* + 2) is always positive. 
Hence we may employ / and Fi = 1. For x = — 00, there is one variation of 
signs; for x = +», no variation. • Hence there is a single real root; it lies between 
1 and 2. 

Example 2. If /(x) = 2 x* — 13 x* — 10 x — 19, we may take 

/i = 4x»-13x-5. 
Then 

2/ = xfi -/,, /2 = 13x2 + 15x + 38 = 13(3. + jj)2 +411. 

* Usually we would require that ki be positive for all values of x, since we usually 
wish to employ the limits — ao and +^* 
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Since /j is always positive, we need go no further (we may take ^2 = 1). For 

re = — 00, the signs are H h; f or x = +oo, + + +. Hence there are two 

real roots. The signs f or x = are h. Hence one real root is positive and 

the other negative. 

EXERCISES 

Isolate by Sturm's theorem the real roots of 

"1. x' + 3x2-2x-5 = 0. ^2. x<+12x2 + 5x-9 = 0. 

-3. x'-Tx-? = 0. ^4. 3x*-6x2 + 8x-3 = [stopwith/J. 

5. x« + 6x* - 30x* - 12x - 9 = [stop with/2]. 
^6. x*-8x» + 25x»-36x + 8 = 0. 

-7. For/=x3 + px + g(p5^0), /i = 3x2 + p, /2=-2px-3g, 

4pyi = (-6px + 95)/2-/3, /8= ~4p3-27g2, 

so that /i is the discriminant A (Ch. Ill, § 3). Let [p] denote the sign of p. Then 
the signs of /, /i, fz, ft are 

- + +bl [A] forx=-oo, 

+ + -bl [A] forx=+oo. 

For A negative there is a single real root. For A positive and therefore p negative, 
there are 3 distinct real roots. For A = 0, /2 is a divisor of /i and /, so that 
X = —3 5/(2 p) is a double root. 

8. If one of Sturm's functions has p imaginary roots, the initial equation has at 
least p unaginary roots. (Darboux.) 

6. Sturm's Functions for a Quartic Equation. For the reduced quar- 
tic equation f{z) = 0, 

/ = z* + qz^ + rz + 8, 

(5) /i = 43» + 2g3 + r, 

/2 = -2^z2- 3r2-4«. 

Let g ?^ and divide g^/i by /2. The negative of the remainder is 

(6) /3 = Lz- 12rs-rg2, L = 8gs - 2^3 - Or^. 

Let L 7^ 0. Then /j is a constant which is zero if and only if / = has 
multiple roots, i.e., if its discriminant A is zero. We therefore desire /i 
expressed as a multiple of A. By Ch. IV, § 4, 

(7) A = -4P»-27Q^ P=-48-^, Q = f g« - r^ - ^g*. 
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We may employ P and Q to eliminate 

(8) 4s=-P-^, r2= -Q-f(?P-^g». 

We divide L^i by 

(9) /8 = L2 + 3rP, L^9Q + 4tqP. 
The negative of the remainder is 

(10) 18 J^qP^ - 9 r^LP + 4 sL^= g^A. 

The left member is easily reduced to g^A. Inserting the values (8) and 
replacing U by L(9 Q + 4 gP), we get 

Replacing L by its value (9), we get g^A. Hence we may take 

(11) fi = A. 

Hence if gLA 5^ 0, we may take (5), (9), (11) as Sturm's functions. 
Denote the sign of q by [q]. The signs of Sturm's functions are 

+ - - [g] - [L] [A] for X = - 00, 
+ + -Iq] m [A] for X = + 00. 

First, let A > 0. If g is negative and L is positive, there are four real 
roots. In each of the remaining three cases for q and L, there are two 
variations of signs in either of the two series and hence no real root. 

Next, let A < 0. In each of the three cases in which q and L are not 
both positive, there are three variations of signs in the first series and one 
variation in the second, and hence just two real roots. If q and L are 
both positive, the number of variations is 1 in the first series and 3 in the 
second, so that this case is excluded by the Corollary to Sturm's Theorem. 
To give a direct proof, note that by the value of L in (6), 4 « > g*, and 
that P is negative by (7), so that each term of (10) is ^ 0, whence A > 0. 

Hence, if qL\ j^ 0, there are four distinct real roots if and only if A 
and L are positive, and q negative; two distinct real and two imaginary 
roots if and only if A is negative. See Ex. 5 below. 

EXERCISES 

1. If ^A ?^ 0, L = 0, then fz = SrP is not zero and its sign is immaterial in 
determining the number of real roots: two if 5 < 0, none if g > 0. By (10), 
q lias the same sign as A. 
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"-2. If rA ?^ 0, g = 0, obtain — /j by substituting 2 = — 4 s/(3 r) in /i. Show 
that we may take /a = rA and that there are just two real roots if A < 0, no real 
root if A > 0. 

^3. If A ?£ 0, 5 = r = 0, there are just two real roots if A < 0, no real root if 
A > 0. Since A = 256 s', check by solving z* -f s = 0. 

^4. If A ?^ 0, ^L = 0, there are just two real roots if A < 0, no real root if 
A > 0. [Combine the results in Exs. 1-3.] 

5. If A < 0, there are just two real (distinct) roots; if A > 0, ^ < 0, L > 0, 
four distinct real roots; if A > and either ^ = or L = 0, no real root. [Com- 
bine the theorem in the text with that in Ex. 4.] 

6. Apply the criterion in Ex. 5 to Exs. 2, 4, 6, p. 99. 

7. Apply to Exs. 1-3, p. 39, and Exs. 1^, p. 43. 

8. Show that the criterion of Ex. 5 is equivalent to the theorem in Ch. IV, § 7. 
If A > 0, L > 0, 5 < 0, then 4 s - 5* < by (6). Conversely, if A > 0, 5 < 0, 
48- q^ < 0, then L > 0. For, if L = 0, 9 Q = -4 5P < 0, since P < by the 
value (7) of A. Thus 81 Q^ = 16 q^P^, A = 5, where 

5= -4P»- Vg'^ = 4P2(-P- 1^2) =4P2(4s_g2) <o, 

— P having been replaced by its value in (7). Thus A < 0, contrary to hypothesis. 
The two criteria for four real roots are therefore equivalent. The criterion for 
2 distinct real and 2 imaginary roots is A < in each theorem. By formal logic 
the criteria for no real root must be equivalent. 

9. If a J fi, y are the roots of a cubic equation f(x) = 0, Sturm's functions* 
ft fh hy h equal products of positive constants by 

(x-a)(x-/3)(x-T), S(a:-/3)(X-T), S(a-mx-T), (« - m« - t)H/3 - t)^ 

Why is it sufl&cient to prove this for a reduced cubic equation? 

Take / as in Ex. 7, p. 99. Proof is needed only for the third function. In it 
the coefficient of x equals 2 So^ — 2 2a/3 = — 6 p, while the constant is 

— Sa^7 + 6 afiy = —3 ^ — 6 g, 

by Ex. 1, p. 64. Thus the third function equals 3/2. 

10. Sturm's functions for any equation with the n roots a, /3, . . . , x, w equal 
products of positive constants by 

(x — a) ... (x — «), S(x — /3) . . . (x — «), S(a — /3)2(x — y) . . . (z — <a), 
S(a - /3)2(a - y)\^ - yy{x - 5) ... (x -«),... , (a - /3)2 . . . (^ - 0,)^. 

Verify this for n = 4, using § 6. A convenient reference to a proof for any n is 
Salmon's Modern Higher Atgebra, pp. 49-53. 

11. There arc as many pairs p of imaginary roots as there are variations of 
signs in the leading coefficients of Sturm's functions, i.e., p = F+oo- Hints: 
Let a, 6, c be the leading coefficients of three consecutive Sturm functions. If 
a and c have opposite signs, the three functions show a single variation of signs for 

* In Exs. 9-12, it is assumed that there are n -f 1 Sturm's functions for the equation 
of degree n. 
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X = — Qo and for x = + oo ; if they have like signs, the numbers of variations 
are 0, 2 or 2, 0. 

Hence V-ao + V+co = n, the degree of the equation. 

Subtract F-oo — ^+« = ^i the number of real roots. 

Thus 2 y+oo = n - r = 2 p. 

12. By Exs. 10, 11, the number of pairs of imaginary roots is the number oi 
variations of signs in the series 

1, n, S(a - fi)\ 2(a - met - yYifi - T)^ . . . , 

provided no one of these sums is zero. 

T.f Sturm's Theorem for the Case of Multiple Roots. Let*/n(x) be 
the greatest common divisor of f{x) and /i = /'(x). We have equations 
(1) in which fn is now not a constant. The difference Va — Vbisihe num-' 
her of real roots between a and 6, each multiple root being counted only once. 

If p is a root of /»(x) = 0, but not a multiple root of /(x) = 0, then 
/»-i(p) 9^ 0. For, if it were zero, x — p would by (1) be a conmion factor 
of / and /i. We may now proceed as in the second case in § 4. 

The third case requires a modified proof only when r is a multiple root. 
Let r be a root of multiplicity m, m ^ 2. Then/(r), /'(r), . . . , /^■^^>(r) 
are zero and, by Taylor's Theorem, 

^'('■+p> = i.2.r(^-i) ^"'^<'-)+ • •■• • 

These have like signs if p is a positive number so small that the signs of 
the polynomials are those of their first terms. Similarly, /(r — p) and 
/'(r — p) have opposite signs. Hence/ and /i show one more variation of 
signs f or X = r — p than for x = r + p. Now (jr — r)"*"* is a factor of 
/and/iandhence, by (l),of/2, . . . , fn- Let their quotients by this factor 
he it>f it>if . . . , <f>n- Then equations (1) hold after the fs are replaced by 
the <t>'s. Taking p so small that <f>i{x) = has no root between r — p and 
r + p, we see by the first and second cases in § 4 that <<>i, . . . , ^ show 
the same number of variations of signs for x = r.— pasforx = r + p. 
The same is true for /i, . . . , /n since the products of <<>i, . . . , 0n by 

• 

(x — r)"*~* have for a given x the same signs as <<>i . . . , 0» or the same 
signs as — 01, . . . , — <<>n. But the latter series evidently shows the 
same number of variations of signs as <^i, . . . , 0n* Hence (4) is proved 
and consequently the present theorem. 

* The degree of /(x) is not n, nor was it necessarily n in § 4. 
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EXERCISES 

Lt For/ = X* - 8x« + 16,/i = a:' - 4a;,/2 = x* - 4,/i = a/j. Hence n = 2. 
Then F_oo = 2, F* = 0, and there are only two real roots, each a double root. 
2.t / = (x - l)\x - 2). 3.t (x - mx + 2)3. 4.t x^-x^-2x + 2. 

8.t Budan's Theorem. Let a and b be real numbers j a < b, neither 
a root of f{x) = 0, an eqriaiion of degree n with real coefficients. Let Va 
denote the number of variaiions of signs of 

(12) m, fix), fix), .... f^x) 

for X = a, after vanishing terms have been deleted. Then Va — ^6 is either 
the number of real roots of f{x) = between a and b or exceeds the number of 
those roots by an even integer. A root of multiplicity m is here counted as m 
roots. 

In case Fo — Ffe is or 1, it is the exact number of real roots between a and 6. 
In other cases, it is merely an upper Umit to the number of those roots. While 
therefore the present method is not certain to lead to the isolation of the real 
roots, it is simpler to apply than Sturm^s method. Indeed, for an equation of 
degree 6 or 7 with simple coefl&cients, Sturm's functions may introduce numbers of 
50 or more figures. 

.The proof is quite simple if no term of the series (12) vanishes for 
a: = a or for x = b and if no two consecutive terms vanish for the same 
value of X between a and b. Indeed, if no one of the terms vanishes for 
Xi ^ X ^ X2, then Fx, = F,,, since any term has the same sign for x = Xi 
as for X = X2. Next, let r be a root of f^^{x) = 0, a < r < 6. By hy- 
pothesis, the first derivative /(*"^*>(x) of f^^{x) is not zero for x = r. As in 
the third step (now actually the case i=0) in § 4, f^^{x) and /(*"*■*) (x) show 
one more variation of signs for x = r — p than for x = r + p, where p is 
a suflSciently small positive number. If i > 1, /(O is preceded by a term 
/(»-!) in (12). By hypothesis, /(*~*)(x) ?^ for x = r and hence has the 
same sign for x = r — p and x = r + p when p is sufficiently small. 
For these values of x, /(*)(x) has opposite signs. Hence /(*-^) and /(*) 
show one more or one less variation of signs for x = r — p than for 
X = r + Pj so that /(*"*^ /^*^, f^^^^ show two more variations or the 
same number of variations of signs. 

Next, let no term of the series (12) vanish f or x = a or for x = 6, but 
let several successive terms 

(13) f^Hx), f^'^Hx), . . . , fi»-'-Hx) 
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all vanish for a value r of x between a and b, while /(*"*"'^ (r) is not zero, 
say positive.* Let /i be the interval between r — p and r, and 1 2 the 
interval between r and r + p. Let the positive number p be so small 
that no one of the functions (13) or /(*"^')(x) is zero in these intervals, so 
that the last function remains positive. Hence /(*+'"*) (^) increases with 
X (since its derivative is positive) and is therefore negative in /i and positive 
in /2. Thus f^'^^'~^^(x) decreases in /i and increases in 1 2 and hence is 
positive in each interv^al. In this manner we may verify the signs in the 
following table: 



/i 
h 



/(»•) /(»■+!) /(»+2) ^ ^ ^ y(i+;-3) y(»>;-2) ^(t+y-l) y(»+;-) 

(-1)' (-1)'-^ (-l)-2 ..." + - + 

+ + + . . . + + + + 



Hence these functions show j variations of signs in /i and none in I2. 

If i > 0, the first term of (13) is preceded by a function /(*"^>(x) which 
is not zero for x = r, and hence not zero in /i or h if p is sufficiently small. 

If j is even, the signs of /('~^^ and/('^ are + + or h in both /i and hy 

showing no loss in the number of variations of signs. If j is odd, their 

signs are 

/i + - 

or 

h ++ - + 

so that there is a gain or loss of a single variation of signs. Hence 

/(.-.), /(.•), /(.•+!), . . . , /(.•+,-> 

show a loss of j variations of signs if j is even, and a loss of j ifc 1 if j is 
odd, and hence always a loss of an even number ^ of variations of 
signs. 

If i = 0, /^*> = / has r as a j-fold root and the functions in the table show 
j more variations of signs for x = r — p than for j = r + p. 

Thus, when no one of the functions (12) vanishes for x = a or for a: = 6, 
the theorem follows as at the end of § 4 (with unity replaced by the mul- 
tiplicity of a root). 

Finally, let one of the functions (12), other than /(x) itself, vanish for 
X = a or for X = 6. If 5 is a sufficiently small positive numl^er, all of the 
N roots of /(x) = l>etween a and b lie between a + 5 and 6 — 6, and 

* If negative, all sigas in the table below are to be changed; but the conclusion holds. 
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for the latter values no one of the functions (12) is zero. By the above 
proof, 

F«-7a+a = 2i, 76-3-76 = 25, 

where t, j, « are integers ^ 0. Hence Va— 76 = iV + 2 (^ + j + s). 
Example. For/ = x^ — Ix — 7, 

There is one variation of signs for x = 3, but none for x = 4, so that just one real 
root lies between 3 and 4. For 



x\ f r r 



-2 
-1 



— 1 H-5 — 12 -f6 3 variations 

— 1 — 4 — 6 +6 1 variation. 



Thus there are two real roots or no real root between —2 and —1. The former is 
the case. The reader should isolate the two roots by finding an intermediate value 
of X for which the series shows two variations of signs. 

EXERCISES 

Isolate by Budan's theorem the real roots of 

l.t a:»-x*-2x + l = 0. 2.t x' + 3x2 - 2x - 5 = 0. 

3.t If /(a) 7^ 0, Va equals the number of real roots > a or exceeds that number 
by an even integer. 

4.t There is no root greater than a number making each of the functions (12) 
positive, if the leading coefficient of /(x) is positive. (Newton.) 

5.t Divide /(x) = x** + aix"^"^ -f • • • by x — «; then 

/(x) ^ix-a)lx^-' + x^'^gM+ . . . + ^„-i(a) }+/(«), 

where gi(a) = a + ai, gf2(a) = a* + oia + oj, . . . . If a is chosen so that 
gMf . . . , fl^n-i(a), /(a) are all positive, no positive root of /(x) = exceeds a. 
(Laguerre.) 

9. Descartes' Rule of Signs. The number of positive roots of an 
equation with real coefficients either equals the number V of varialions of 
signs in the series of coefficients or is less than 7 by an even integer. A root 
of multiplicity m is here counted as m roots. 

For example, x® — 3x* + x + l = has either two or no positive 
roots, the exact number not being found. But — Sx' + x + l = has 
exactly one positive root. 
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Consider any equation with real coefficients 

fix) = OoX'* + aiX*-^ + • • • + On-lX + On = 0, 

with a„ 3^ 0. For x == the functions (12) have the same signs as 

so that Vo = V. For x = +oo, the functions have the same sign (that 
of Oo). Thus Fo — V'oo = V is either the number of positive roots or 
exceeds that number by an even integer. Next, the theorem holds if 
/(O) = 0, as sho\\Ti by removing the factors x. 

Corollary. The number of negative roots of f{x) = is either the 
number of variations of signs in the coefficients of /(— x) or is less than 
that number by an even integer. 

Thus x* — Sx^ + x + l = has either two or no negative roots, since 
x« - 3 x2 - X + 1 = has two or no positive roots. 

EXERCISES 

1. x* — 3x + 2 = has one negative root and two equal positive roots. 

2. x* + a*x + 6* = has two imaginary roots if 6 ^ 0. 

3. For n even, x** — 1 = has only two real roots 

4. For n odd, x** — 1 = has only one real root. 

5. For n even, x** + 1 = has no real root; for n odd, only one. 

6. x^ + 12 x^ + 5 X — 9 = has just two imaginary roots. 

7. X* + aV H-6^ — c2 = 0(c?^0) has just two imaginary roots. 

8. To find an upper limit to the number of real roots of /(x) = between a and 6, aet 



a + by 

X = 



multiply by (1 + y)", and apply Descartes' Rule to the resulting equation in y. 

10. t Fourier's Method. If Budan's Theorem gives a loss of two or 
more variations of signs in passing from a to a larger value 6, and hence 
leaves in doubt the number of real roots between a and 6, we may employ 
a supplementary discussion. 

First, let/, /', /" show two variations of signs at a and no variation at 6, 
while the series beginning with /" shows no loss in variations (as in the 
Example in § 8). Then /" is of constant sign between a and b, and the 
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graph of 1/ = f{x) has a (single) maximiim or minimum pouit between 
a and b] according as f" is negative or positive. If the sum 

m m 

fib) /'(a) 

of the subtangents at the points with the abscissas a and 6 is > 6 — a, 
the tangents cross before meeting the x-axis and the graph does not inter- 
sect the X-axis between a and 6, so that there are two imaginary roots 
in view of Budan's Theorem and 

(14) n =7-00 -^co = (^-00 - Va) + (Va " Vt) +[(¥,- VJ. 

In the contrary case, we examine the value half way between a and 6, 
etc. Clearly the case of imaginary roots will disclose itself after a very 
few such steps. 

Next, in the general case, we shall find, after a suitable subdivision of 
the interval, three consecutive functions 

showing two variations of signs at a' and ^o variation at b', while the 
later terms of the series show no loss in variations of signs. We may 
therefore decide as in the first case whether there are two real roots of 
Z^') = in the interval [a', 6'] or not, and in the latter alternative conclude 
that/ = has two imaginary roots.* 

Example. Let /(x) = x« - 5a:* - 16x« + 12x* - 9x - 6. Then 

fix) = 5a:*-20x'-48x« + 24x-9, 
i/"(x) = 5x» - 15x2 - 24x + 6, 
A/'"(x) = 5x2-10x-8, 
Thr"(x) = x-1, /W(x) = 120. 

There is just one real root in each of the intervals (— 3, -•2), ( — 1, 0), (7, 8). The 
interval (0, 1) is in doubt, the signs being 

- - + - - + for X = 0, 

— — — — — + for X = 1, 

where is read — . The j of the text is here 1. Now 

rH) nO) ^ -48 Q _ 3 . 3 

/"(I) /"(O) 4 (-28) 4(6) 7 "^ 8 ' 

* For further details, see Serret, Alghbre Sup&rieure, ed. 4, 1, pp. 305-318. 
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80 that we must subdivide the interval. For x » }, the signs are the same as 
for X = 1. Thus the loss in variations of signs occurs in the interval (0, i). Now 

ru) r(o) 4(-9i)'^8^2' 

Hence there are two imaginary roots. 

EXERCISES 

l.t X* — 3a:* + 2x* — 8x' + 3x — 25 = has 4 imaginary roots. 
2.t x* + x* — X* — x* + x* — x + l = has 6 imaginary roots. 



CHAPTER X 

Solution of Numerical Equations 

1. Newton's Method. To find the root between 2 and 3 of 

x»- 2x- 5 = 0, 

Newton * replaced x by 2 + p and obtained 

p^ + 6p2 + i0p- 1 = 0. 

Since p is a decimal, he neglected** the first two terms and set 10 p— 1 = 0, 
so that p = 0.1, approximately. Replacing p by 0.1 + g in the preceding 
cubic equation, he obtained 

5» + 6.3g2 + 11.23g + 0.061 = 0. 

Dividing —0.061 by 11.23, he obtained —0.0054 as the approximate 
value of q. Neglecting ^ and replacing q by —0.0054 + r, he obtained 

6.3 r2 + 11.16196 r + 0.000541708 = 0. 

Dropping 6.3 r^, he found r and hence 

X = 2 + 0.1 - 0.0054 - 0.00004853 = 2.09455147. 

This value is in fact correct to the seventh decimal place. But the 
method will not often lead as quickly to so accurate a value of the root. 

The method is usually presented in the following form. Given that a 
is an approximate value of a real root of f(x) = 0, we can usually find a 
nearer approximation o + A to the root by neglecting the powers A^ A', . . . 
of the small number h in Taylor's formula 

Ka + h)=m+r{a)h+r{a)^+ . . . 
and hence by taking 

We then repeat the process with a + A in place of the former a. 

♦ Isaaci Newtoni, Opuscyla^ I, 1794, p. 10, p. 37 [found before 1676]. 
** At this early stage of the work it is usually safer to retain also the term in p* and 
thus find p approximately by solving a quadratic equation. 
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Thus in Newton's example, we have, for a = 2, 

^^ = 7^=^' a^ = a + A = 2.1, 

-/(2.1 )^~ 0.061^ 
^ /'(2.1) 11.23 ^'^^ • • • • 

2. Graphical Discussion of Newton's Method. Using rectangular 
coordinates, consider the graph of y = f(x) and the point P on it with the 
abscissa OQ = a (Fig. 22). Let the tangent at P meet the x-axis at T 



a 




Q T Tj 



Fig. 22 




Fig. 23 



and let the graph meet the x-axis at S. Take h = QT, the subtangent. 
Then 

QP =/(a), f\a) = tBuXTP = -/(a)/A, 

'^ " T(a) • 

In the fictitious graph in Fig. 22, OT = a + A is a better approximation 
to the root OS than OQ = a. The next step (indicated by dotted lines) 
gives a still better approximation OTi, * 

If, however, we had begun with the abscissa a of a point Pi near a bend 
point, the subtangent would Ix) ver>' large and the method would probably 
fail to give a better approximation. Failure is certain if we use a point 
Pi such that a single lx*nd point lies between it and S, 

We are concerned with the approximation to a root previously isolated 
as the only real root between two given numbers a and 0. These should 
be chosen so nearly equal that/'(x) = has no real root between a and /3, 
and hence /(x) = no bend point between a and 0. Further, iif"{x) =■ 
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has a root between our limits, our graph will have an inflexion point 
with an abscissa between a and /3, and the method likely will fail (Fig. 23), 
Let, therefore, neither f'(x) nor /"(^) vanish between a and /3. Since 
/" preserves its sign in the interval from a to P, while / changes in sign, 
/" and / will have the same sign for one end point. According as the 
abscissa of this point is a or /3, we take o = a or o = /3 f or the first step of 
Newton's process. In fact, the tangent at one of the end points meets 
the X-axis at a point T with an abscissa within the interval from a to ff. 
If /'(x) is positive in the interval, we have Fig. 24 or Fig. 25; iif is nega- 
tive. Fig. 26 or Fig. 22. 






Fig. 24 



Fig. 25 



^^^ 



Fig. 26 




In Newton's example, the graph between the points with the abscissas a = 2 
and /8 = 3 is of the type in Fig. 24, but more nearly like a vertical straight line. 
In view of this feature of the graph, we may safely take a = a, as did Newton, 
although our general procedure would be to take a = /8. The next step, however, 
accords with our present process; we have a = 2, /3 = 2.1 in Fig. 24 and hence 
we now take a ~ /9, getting 

0.061 ^^,, 

11:23 = «•««* 

as the subtangent, and hence 2.1 — 0.0054 as the approximate root. 

If we have secured (as in Fig. 24 or Fig. 26) a better upper limit to the 
root than /3, we may take the abscissa c of the intersection of the chord 
AB with the x-axis as a better lower limit than a. By similar triangles, 

-/(a) :c-a=/03) : /3 - c. 



(1) 



^ a/09) - pfja) 



This method of finding the value of c intermediate to a and /3 is called the 
method of interpolation (regula falsi). 
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In Newton's example, a = 2, /8 = 2.1, 

/(a) = -1, /03) = 0.061, c = 2.0942. 

The advantage of having c at each step is that we know a close limit of 
the error made in the approximation to the root. 

We may combine the various possible cases discussed into one: 

If /(^) = has a single real root and f'{x) = 0, /"(^) = fiave no real root 
between a and /3, and if we designate by /3 that one of the numbers a and fi 
for which f(fi) and f'ifi) have the same sign, then the root lies in the rujrrower 
interval from ctofi— f{fi)/f(fi), where c is given by (1). 

It is possible to prove* this theorem algebraically and to show that by 
repeated appUcations of it we can obtain two limits a', /3' between which 
the root lies, such that a' — fi' is numerically less than any assigned 
positive number. Hence the root can be found in this manner to any 
desired accuracy. 

Example, /(x) = x» - 2 x^ - 2, a = 21, /3 = 2i. Then 

Neither of the roots 0, 4/3 oif{x) = lies between a and /8, so that/(x) =« has 
a single real root between these limits (Ch. IX, § 1). Nor is the root | of /"(x) — 
within these limits. The conditions of the theorem are therefore satined. For 
a < X < /3, the graph is of the type in Fig. 24. We find that 

c = Ml = 2.^^ fi'^fi -^j = 2.3714, 

/3'-ji^ = 2.3597. 

For X = 2.3593, /(x) = -0.00003. We therefore have the root to four dedmal 
places. For a = 2.3593, 

f{a) = 7.2620, a-^ = 2.3593041, 

which is the value of the root correct to 7 decimal places. For, if we change the 
final digit from 1 to 2, the result is greater than the root in view of our work, while 
if we change it to 0, /(x) is negative. 

♦ Weber's Algebra, 2d ed., I, pp. 380-382; Kleines Lehrbuch der Algebra, 1912, p. 168. 
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EXERCISES 

(Preserve the numerical work for later use.) 

1. Mnd the root between 1 and 2ofa:^ + 4a;2— 7 = correct to 7 decimal 
places. 

2. Find the root between —1 and —2 to 5 decimal places. 

3. Find a root ofx' + 2x + 20 = 0to5 decimal places. 

3. Systematic Computation by Newton's Method. Set 



fi = i/"> /s = ?r~Q/'" = i /«'> /* = 



2-3 



'^^^^ ^y.'^?^^^?'y?T;Y|f'r 



2.3.4 



J ^4j8|«»»» 



/ - i'Af' i ■, 






/ (x + A) = /(a;) + hf(x) + h'Mx) + }?Mx) + ii*Mx) + • • • . 

fix + h)= fix) + 2 hMx) + 3 K'Mx) + 4 Ay4(x) + • • • . 

/iCa; + A) = /!i(a;) + 3 hfzix) + 6 hj,{x) + ■ ■ • . 

Mx + h)= Mx) + 4 hMx) + . . • . 

The second formula may also be derived from the first by differentiation 
with respect to h (or if we prefer, with respect to a;), and likewise the 
third from the second, with a subsequent division by 2, etc. 

Theworkof findmg/(a; + /i),/'(x + ^), • . . from/(a;),/'(a;),/2(a;), . . . 
may be arranged as follows for the case n = 3, whence /« = 0: 





f 
+ h{f, + A/,) 

/' + hf, + h% 

+ A(/2 + 2 A/a) 


f 
+ hif + hf, + AV,) 


/. + A/3 


/ + A/' + A% + A»/, 
= fix + A) 


ft + 2 hf» 


f' + 2hfi + 3h% 




= fix + A) 



/2 + 3A/8 =f2(x + h) 

Here we have added hfi to /2. This sum is multiplied by h and the 
product added to /'. To the resulting sum is added h times the second 
sum /2 + 2 ft/a in the second colunm; etc. 

Example 1. f{x) = a^ -2x^-2, Then 

/'(a;) = 3 x« - 4 a;, /2(x)=3x-2, /aW = 1. 

Their values for x = /3 = 2} are given in the first line below. Since* 
h = -///' = -0.129, the work is as follows: 

* Ordinarily we would use at this step the value h «— .13, which is sufficiently 
exact and simplifies the numerical work. 
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4 
.1 



f 
I 



1 6.6 


8.75 


1.125 


-0.129 


-0.69286 


-1.03937 


5.371 


8.05714 


0.08563 


-0.129 


-0.67622 




5.242 


7.38092 




-0.129 







1 5.113 

The numbers at the bottom are the values of 

/3, fiifi% /V), /W for /?' = /8 + ;i = 2.371. 
Example 2. Netto treats in his Algekra the equation 

f{x) =a:^ + x»-3x*-a;-4 = 0. 
Then 

/'(x)=4x» + 3x«-6a;-l, /j = 6a;» + 3x - 3, /, = 4x + l, /» - 1. 

Since /(I) = -6, /(2) = 6, there is a root of J{x) = between 1 and 2. By 
Descartes' Rule, f\x) and /aCx) each have a single positive root. Since /'(I) " 0, 
/j(l) = 6, /i(2) = 27, neither has a root between 1 and 2. Since /(2) and /"(2) 
are of like sign, we take /8 = 2. The values of /i, . . . , / f or x = 2 are given 
in the first line below. 



1 9 


27 


31 


6 


-0.2 


- 1.76 


- 5.048 


-6.1904 


8.8 


25.24 


25.952 


0.8096 


-0.2 


- 1.72 


- 4.704 




8.6 


23.52 


21.248 




-0.2 


- 1.68 






8.4 


21.84 




-0.2 









-6 
31 



-0.2 



1 



8.2 
-0.04 - 0.3264 



- 0.860544 -0.81549824 



8.16 21.5136 20.387456 
-0.04 - 0.3248 - 0.847552 



8.12 21.1888 
-0.04 - 0.3232 



-0.8096 
21.248 
-0.04 



-0.00589824 



19.539904 



8.08 
-0.04 



20.8656 



8.04 



0.00589824 
19.539904 



-0.000302- 



f 
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The root is 2 - 0.2 - 0.04 + 0.000302 = 1.760302, in which only the last figure 
is in doubt. Indeed, it can be proved that if the quotient f/f begins with k zeros 
when expressed as a decimal^ the best approximation is obtained by carrying the division 
to 2 k decimal places. 

EXERCISES 

1. Extend the work of Example 1 above. 

2. Apply the present method to Exs. 1, 2, 3, page 113. 

3. Treat in this way Newton's example (§1). 

4. In the four long formulas at the beginning of § 3, any arithmetical coefficient 
equals the sum of the one preceding it and the one above that preceding one, as 
6 = 3 + 3, 4 = 1 + 3. 

4. Homer's Method.'*' To find the root between 2 and 3 of 

x»-2x--5 = 

by the method now to be explained, we shall modify in two respects the 
process used by Newton (§ 1) . While in the latter process we set x = 2 + p 
and found the cube of 2 + p, etc., in order to form the transformed equation 

p^ + 6p2+i0p- 1 = 

for p, we shall now obtain this equation by a different process. Since 
p = X — 2, 

a:' - 2 a; - 5 s (x - 2)' + 6 (x - 2)2 + 10 (x - 2) - 1, 

identically in x. Hence — 1 is the remainder obtained when x* — 2 x — 5 
is divided by x — 2; the quotient Q evidently equals 

(x - 2)2 + 6 (x - 2) + 10. 

Similarly, 10 is the remainder obtained when this Q is divided by x — 2 
and the quotient Qi equals (x — 2) + 6. Another division gives the 
remainder 6. Hence to find the coefficients 6, 10, — 1 of the terms after 
p^ in the new equation in the variable p = x — 2, we have only to divide 
the given function x* — 2x — 5byx — 2, the quotient Q by x — 2, etc., 
and take the remainders —1, 10, 6 in reverse order. However, when the 
work is performed as tabulated below, no reversal of order is needed, 
since the coefficients then appear on the page in their desired order. 

* W. G. Homer, London Pkilosophical TransactionSy 1819. 
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Synthetic Division. We next explain a brief method of performing a 
division by z — 2 and, in general, by x — h. When we divide 

fix) = oox" + aix"-! +•••+«„ 

by X — h, let the constant remainder be r and the quotient be 

q(x) = M""^ + 6ix"~* + • • • + 6n-i. 

Comparing the coefficients of f{x) with those in 
{x — h) q{x) + r 

=6oa:"+(6i--A6o)x»-i+(62-/i6i)x»-2+ • • • +(6n-i--A6n-2)x+r-A6«-i, 
we obtain relations which may be written in the form 

&o=ao, 6i = ai+W>o, 62 = 02+^61, . . . , 6n-i = an-i+A6n-2i r=an+A6n-i. 
The steps in the work of computing the 6's may be tabulated as follows: 

Oo 



hbo 



(h 
Kb, 



• • • 



dn-l 
hbn-2 



an \h 
hbn-i 



bo 61 62 . . . bn-ly T 

In the second space below Oo we write 60 (which equals Oo). Then mul- 
tiply 60 by A and enter the product under ai, add and write the sum 61 
below it, etc. Tliis process was used in Ch. I, § 5, to get the value r 
of /(ft). See also Ch. VI, § 6. 



In our example, the work is as follows: 

10-2 -5 
2 4 4 



^ 



1 


2 


2 


-1 




2 


8 




1 


4 
2 


10 





1 G 

Thus 1, 6, 10, —1 are the coefficients of the equation in p. 

But there is a more essential difference between the methods of Homer 
and Newton than the detail as to the actual work of finding the trans- 
formed equations. Newton used the close approximation 0.1 to the root 
of the equation in p. As this value exceeds the root p and hence would 
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lead to a negative correction at the next step, Homer would have used 
the approximation 0.09 (taking a decimal, with a single significant figure, 
just less than the root). The next steps of Homer's process are as 
follows: 



1 6 
0.09 




10 
0.5481 


-1 0.0 
0.949329 


9 


1 6.09 10.5481 
0.09 0.5562 


-0.050671 


1 6.18 
0.09 


11.1043 




0.05 
11.1 


1 6.27 




=0.004 


0.004 0.025096 


0.044517584 


i«(./i-3M.-fc 


1 6.274 11.129396 
0.004 0.025112 


-0.006153,416 




— r. -L" ^s . yoo i i 


1 6.278 
0.004 


11.154508 


II, < J"f 9 "l 


1 6.282 









Hence x = 2.094+i, where t is between 0.0005 and 0.0006. Thus ^3+6.282 1^ 
is between 0.0000015 and 0.0000023, so that the constant term should be 
reduced by 2 in the sixth decimal place. We now have .,-».> 

11.154508^ = 0.006151+, ^ = 0.0005514+, ^ ' 

with doubt only as to whether the last figure of t should be 4 or 5. 

ElxAMPLE 1. Find the root between 1 and 2, correct to seven decimal places, of 
x^ + 4 x* - 7 = 0. 

See p. 118. The figure in the fourth decimal place is evidently 2. Thus 
X = 1.164 + 2/, 0.0002 < 2/ < 0.0003, y^ + 7A92if + • • • =0, 

0.000000299 < 2/3 + 7.492 1/ < 0.000000675, 
0.003316381 < 13.376688 y < 0.003316757, 
0.00024792 < y < 0.00024795. 

Hence x = 1. 1642479+ , in which all of the figures are correct. But this work may 
be abridged. The sum of the terms in y^ and y^ has its first significant figure in the 
seventh decimal place, as shown by 7.5 (0.0003)*. Hence, returning to the final 
numbers in our transformation scheme above, we carry the division of 0.0033170 
by 13.376688 until we reach a remainder whose sign is in doubt in view of the 
doubt on the seventh decimal place of the dividend. Doubt would here first arise 
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1 4 

1 





5 




-7 11 
5 


1 6 

1 


5 
6 




-2 


1 6 

1 




11 




• 


1 7 
0.1 




0.71 


|0.1 
1.171 


1 7.1 
0.1 


11.71 
0.72 




-0.829 


1 7.2 
0.1 


12.43 






1 7.3 
0.06 


0.4416 


0.06 
0.772296 


1 7.36 
0.06 


12.8716 
0.4452 


-0.056704 


1 7.42 
0.06 


13.3168 
0.029936 




1 7.48 
0.004 


|0.004 
0.053386944 


1 7.484 
0.004 


13.346736 
0.029952 


-0.003317056 


1 7.488 
0.004 


13 .376688 




1 7 .492 





in the case of the figure 9 in the seventh decimal place of the quotient; but this 
doul)t is removed by noting that the correction to be subtracted from the seventh 
decimal place of the di\ddend is a figure between 2 and 7 (as shown by the above 
examination of the terms in if and i/). 

Ex.\MPLE 2. Find the root between —4 and —3, correct to seven decimal 
places, of the equation in Ex. 1. 

Tsing the multipliers -4, +0.6, +0.008, we find that a: = -4 + 0.608 + y 
where 

y* - 6.176 y^ + 7.380992 y - 0.004556288 = 0. 

Thus y just exceeds 0.0006. The sum of the terms in 7/ and ?/» is —0.000002 to 
six decimal places. Carrying the di\nsion of 0.004558 by 7.3S1 until the sign <rf 
the remainder is in doubt, on account of the doubt in the sixth decimal place, we 



|5. « SOLUTION OF NUMERICAL EQUATIONS 119 

get y = 0.0006175, with the slight doubt due to the approximate value of the 
divisor and that of the y^ term. Since the cube of 6.176 is just less than 235.6 
(as shown by logarithms), the sum of the terms in y^ and y^ is —0.000002356 
to nine decunal places. Carrying out the division of 0.004558644 by the exact 
coefficient of y, we get y = 0.0006176, correct to seven decimal places. Hence 
X = -3.3913823. 

EXERCISES 

1. Find to 7 decimtds the root ofx' + 4x* — 7 = between — Ij —2. 

2. Rnd to 7 decimals all the roots ofx*- 7x — 7 = 0. 
Mnd to 5 decimals all the real roots of 

3. a:» + 2x + 20 = 0. 4. x» + 3x2 - 2x - 5 = 0. 

5. x» + x*-2x-l =0. 6. a:* + 4x»- 17.5x2 -18x + 58.5 = 0. 

7. X* - 11727 X + 40385 = 0. (G. H. Darwin.) 

8. Find to 8 decimals the root between 2 and 3 of x*- x — 9 = by making 
only three transformations. 

t/S.f Without the intermediation of the idea of division by x — A, we 
may show directly that the process of § 4 yields the correct transformed 
equation. For simplicity, we take a cubic equation 

fix) sax« + 6x2 + cx + d = 0. 

Our process was as follows: 

a b c d Vi 

ah ah^ + hh oA» + hh^+ ch 



a ah + b ah^ + bh + c 

ah 2 a¥ + bh 



a 2 ah + b 
ah 



aV + bh^ + ch + d 

= f(h) 



Sah^ + 2bh + c--f(h) 



a 3 ah + b = ^/"W 
Hence the transformed equation is 

u"'(h)p' + if"ih)p^ +nh)p+m = 0. 

The terms of the left member, read in reverse order, are those of Taylor's 
formula for the expansion of /(A + p). Hence the above process yields 
the equation obtained from f(x) = by setting x = h + p, 

^ 6. t Numerical Cubic Equations. After finding a real root r ?^ of 

f(x) = 3? + bz^ + cx + d = 0, 
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we may obtain the remaining roots n and r2 from 

n + r2 = — 6 — r, ' rir2 = — = f^ + br + c. 

T 

We have 

(2) (ri - raV s (n + r2)2 - 4rir2 = 6^ - 4c - 2&r - 3r*. 

Thus ri — r2 is either real or a pure imaginary. Making use also of 
T\ + u, we shall have the real or imaginary expressions of ri, r^. As it 
would be laborious to compute the right member of (2), we may make 
use of a device. We have 

(ri-r2)2 = 62-3c-/'(r). 

The value of f{r) for the approximate value of r obtained at any stage 
of Homer's process is the coefficient preceding the last one in the next 
transformed equation (§5). 

Example. Let /(x) = x' + 4 x^ - 7. By Ex. 1, p. 117, 

/'(1.164) = 13.376688. 

If we continue Homer's process, using the multiplier m = 0.000248, and retaining 
only six decimal places, we see that we must twice add 7.492 m = 0.001858 to the 
preceding /' to get 

/'(r) = 13.380404, r = 1.164248. 

But this continuation of Horner's process is unnecessary. Using /'"(x) = 6 and 
the work on p. 118, we have 

/'(x + m) = /'(x) + m/"(x) + 3 m\ 1/"(1.164) = 7.492, 

/'(r) = 13.37 . . . + 2 m (7.492) + 0.0000002 = 13.3804042. 

Hence we get 

(r, - nY = 2.6195958, n - r j = 1.6185165, 
ri + r, = -5.1642479, n = -1.7728657, r, = -3.3913822. 

?!• Numerical Quartic Equations. Let 

f{x) = x* + 6x' + cx2 + dx + e = 

have two distinct real roots r and s. When these are found approximately 
by Homer's process, we get at the same time /'(r), /'(s), approximately. 
Call the remaining roots ri and r2. Then 

Ti + rj = — 6 — r — s, 
r^n ::^ c- {r + s)(ri + ^2) - rs = c + fe(r + s) + r* + rs + «*, 
(rj - r2)2 = 62 - 4 c - 2 6(r + s) - 3 r2 - 2 rs - 3 s*, 
(n - nYQ) + 2 r + 2 s) = -(n - r2)K2n + 2r2 + 6) 
= 6»-46c-8c(r + s) + 6(- 7r2-10rs-7s2)-6r'-10r2s-10r«*-6A 
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To the second member add the product of 10 by 

Hence 

(n - nYib + 2r + 2s) = &«-46c + 8d +/' (r) + f\s). 

From this equation we get ri — r2 and then find ri and r2, approximately. 

EXERCISES t 

1. After finding one of the real roots of the cubic equations in Exs. 2, 3, 4, 5, 8, 
p. 119, find the remaining roots by § 6. 

2. Treat the quartic equations in Exs. 6, 7, p. 119, by § 7. 
Find two and then all of the roots of 

3. x* + 12x + 7 = 0. 4. a:* - 80x8 + 1998x2 -14937X + 5000 = 0. 

X" 8.t Graffe's Method. First, let all of the n roots Xi, . . . , Xn be real 
and distinct numerically. Choose the notation so that Xi exceeds X2 nu- 
merically and X2 exceeds Xs numerically, etc. In 

(3) szr = xr(n-g + |;+-.-), 

each fraction approaches zero as m increases, so that Xi^ is an approxi- 
mate value of 2xi*" if m is sufficiently large. Similarly, 

(4) Sa:x-X2- = Xx-X2-^l+^ + g;;i+ • • • +^^;^+ ' " ). 

so that Xi'"X2*" is an approximate value of Sxi'"X2'^ for m large. Now 
Xi*", . . . , Xn*" are the roots of 

(5) I/" — Sxi*" • 2/""^ + Sxi'^X2'^ • 2/*"^— • • • = 0. 

As illustrated in the examples below, it is quite easy to form this equation 
(5) for values of m which are the successive powers of 2. After obtaining 
the equation in which m is sufficiently large, we divide each coefficient 
by the preceding coefficient and^ obtain approximate values of the nega- 
tives of Xi*^, X2*", .... Indeed, the coefficients are approximately 

1, — Xi*", Xi*"X2'", — Xi*"X2*"Xs'", .... 

Example 1. For x' + x* — 2 x — 1 = 0, we first form the cubic equation 
whose roots are the squares of the roots Xi, xa, xs of the given equation. To this 
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end, we transpoee the terms a^, — 1, of even degree, square, replace x* by y, and get* 

ys- 52,2 + 62/- 1 = 0, 

whose roots are 2/1 = Zi', yt = xj*, 2/3 = xa*. Repeating the operation, we get 
2^-1322 + 262-1 = 0, t;3-117t;» + 650v-l = 0, 

with the roots Zi = yi', . . . , and vi = 21*, .... Hence the roots of the tMJubic 
are the 8th powers of Zi, X2, Xz, By logarithms, the 8th roots of 117, f{J, ^J^ (the 
approximate values of xi®, X2^ X3*) are 1.813, 1.239, 0.4450, which are therefore 
approximate numerical values of xi, X2, xj. The next step gives the equation 

xv^ - 12389 M^^ + 42226610 -1=0. 

The 16th roots of 12389, etc., are -1.80225, 1.24676, -0.44504, to which the proper 
signs have now been prefixed (their product being positive and sum being —1). 

Instead of repeating the process, we may now obtain as follows the values of the 
roots correct to five decimal places. We had the logarithms of the last approxi- 
mations to the roots and hence see at once that (xj/x2)^' affects only the 8th decimal 
place and that (xi/xiY^'ia still smaller. The coefficient of u? is Sxi*^", whose 
expression (4) involves only the first three terms. Hence 

Xi"x2^« = 422266, 

correct to 7 decimal places. The reciprocal is Xa*', whence xz = —0.44604 to 
5 decimal places. By the approximate values of Xi and xs from the u>-cubic, 
(xi/xi)i« = 0.002751. Thus 

1.002751 xi" = 12389 = Sxi", 
whence Xi = —1.80194 to 5 decimal places. By the displayed equations, 

„ 422266 X 1.002751 

X2" = •■ — , xj = 1.24698. 

^ 12389 ' x..«uj7o. 

We have now found each root correct to five decimal places. As a check, note 
that the roots are (Ch. VIII, § 3, § 8) 

^ 2t - 4t _ 6t 
2 cos---, 2 cos-—, 2cos— -• 

7 7 7 

The above process requires modification if several of the largest roots 
are equal or approximately equal numerically. If Xi and x^ are approxi- 
mately equal, but sufficiently different from Xs, . . . , Xn, numerically, ^ui 
approximate value of Xi*^ is i 2xi*^. 

Next, consider a cubic equation with two conjugate imaginary roots 

• We may use symmetric functions: Syi = Sxi' = (Sxi)' — 2 SxiXt — 5, etc. 
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Xi and Xzy whose modulus (Ch. II, § 8) is r, and a^real root Xi numeri- 
cally greater than r. Then the real number 

is numerically less than or equal to the. sum 



^ / mod. X2 V 



of the moduli of its two parts, and hence approaches zero as m increases. 
Thus, by (3), an approximate value of Xi** is Sxi**. 

Example 2. For x> - 2 x - 2=0, xi > 1.7, xjXj = r» = 2/xi. Since 2 < (1.7)', 
r < 1.7 < Xi. Forming the equation whose roots are the squares of the roots of 
the x-cubic, that whose roots are the fourth powers, etc., we get 

2/'-4y* + 42/- 4 = 0, 

2»-82'-16«-16 = 0, 

t;3. 96^2.256 =0. 
Thus Xi is approximately 

\/96 = 1.7692 .... 
By two more steps, we get 

xi = ^^85032960 = 1.769293, 

correct to six decimal places. 

* 

For a cubic equation in which Xi < r, we employ the equation in X 
obtained by setting x = l/X, Its root 1/xi exceeds numerically the mod- 
ulus 1/r of the imaginary roots l/x2, l/xj. Hence the equation in X is of 
the type last discussed. 

EXERCISES t 

1. The equation whose roots are the 8th powers of the roots xi, Xs, xs of 
x»-4x»-x + 3 = is 

«;» - 74474 u;« + 46213 u? - 6561 = 0. 

Dividing the negative of each coefficient by the preceding coefficient and extracting 
the 8th root of each quotient, we get 4.06443, 0.94, 0.78. The first is a good 
approximation to Xi. The last two are approximately equal and hence not good 
approximations to — xs, xs. To avoid this inconvenience, add unity to each root 
(i.e., replace x by X — 1). Treat the equation in X and so obtain good approxi- 
mations to Xi, Xs, Xt. 



I' 
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Treat by the present methods 

2. a:»-2z-5 = 0. 3. a:» - 2^^ - 2 = 0. 4. a:» + 4x* - 7 = 0. 

5. a:» + 2x + 20 = 0. 

For further details on the determination of imaginary roots by this method 
see Encke, CreUe'a Journal^ vol. 22 (1841), p. 193, and examples by G. Bauer 
Varlesungen vJber Algebra, 1903, p. 244; and C. Runge, Praxis der OUichungen 
1900, p. 157. 

9.t To determine the imaginary roots of an equation f(z) = witl 
real coefficients, expand /(x + yi) by Taylor's Theorem; we get 

fix) +f'{x) yi - f"(x) ^ - f"'{x) j^ + . . . =0. 
Since x and y are to be real, and y^O, rs, -jv*) »• A • y I 



(6) 



/'(x)-/"'(x)j^+/W(x)|j 0. 



By eliminating y^ between these two equations, we obtain an equation 
E{x) = 0, whose real roots x may be found by one of the preceding 
methods. In general the next to the final step of the elimination giveg 
2^^ as a rational function of x, so that each real x which yields a positive 
real value of y^ furnishes a pair of imaginary roots x =t yi of f{z) = 0. But 
if there are several pairs of imaginary roots with the same real part x, the 
equation in y^ used in the final step of the elimination will be of degree 
greater than unity in y^. 

Example. For f{z) = 2* — z + 1, equations (6) are 

x*-x+l -6xV + 2/^ = 0, 4x5- 1 -4xy« = 0. 
Thus 

|/* = x«-— , -4x« + x« + jg = 0. 

The cubic equation in x^ has the single real root 

3^ = 0.528727, x = dbO.72714. 
Then y* = 0.87254 or 0.184912, and 

g^x + yi = 0.72714 ± 0.43001 1, - 0.72714 ± 0.93409 %. 
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EXERCISESt 

1. For the quartic equation in Ch. V, § 1, eliminate y^ between equations X = 0, 
y = 0, corresponding to the present pair (6), and get 

x{x - 2)(16a:* - 64x3 + 136^.2 - I44z + 65) = 0. 

Show that the last factor has no real root by setting 2x == w + 2 and obtaining 
(v)^ + l)(v)^ + 9) = 0. Hence find the four sets of real values x, y and hence the 
four complex roots x + yi. 

2. If r and « are any two roots of /(z) = and we set 



X = 



r + s 



y = 



r — 8 



10 1. Lagrange's Method. The root between 1 and 2 of 

x» + 4x2-7 = 



t.'.'-VV'V 



2 ' " 2i ' 

we have r = x + yiy « = x — yi, so that f(x db yi) = 0. Hence E{x) = has 
as its roots the i n{n — 1) half-^ums of the roots o{f(z) = in pairs. If, however, 
we eliminate x between equations (6) and set —4 y* = 10, we obtain an equation 
in w whose roots are the i n{n — 1) squares of the differences of the roots of f{z) = 0. 

may be expressed as a contin led fraction. Set x = 1 + \/y. Then ^ ' 

-22/» + lli/2 + 7t/ + l=:0. 

Since — 2 y^ + 11 y* must be negative, we have 2/ > 5. 
that y lies between 6 and 7. Set 1/ = 6 + \/z. 

-2 11 7 1 [6 

-12 - 6 6 



We find by trial 



-2 


- 1 
-12 


1 

-78 


7 


-2 


-13 
-12 


-77 





-2 



-25 



_2_25_77 

!? Z^ Z^ ' ' 



72»-77z*-25a-2 = 0. 



Since 7 2» - 77 «* > 0, z > 11. The value of z lies between 11 and 12. 
Now 

1 7z + l 



x = l + 



6 + 1 
z 



62+1 



* We may of course first set x » 1 + (2, find the cubic equation in (2 by our earlier 
method, and then replace d by \/y. 
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Using. 2 = 11, we find that x is just smaller than 1.1642. But 2 is in fact 
just greater than 11.3. Using 2 = 11.3, we find that 

x = ^ = 1.1642+. 

Hence X = 1.1642 to four decimal places. 

There is a rapid method of evaluating a continued fraction and a means 
of finding the limits of the error made in stopping the development at a 
given place. For an extensive account of the theory and applications of 
continued fractions, see Serret's Cours d^Algibre SuphieurCy ed. 4, I, 
pp. 7-86, 351-368. 



CHAPTER XI 



(1) 



Determinants; Systems of Linear Equations 

1. In case there is a pair of numbers x and y for which 

aix + 6iy = fci, 
flaic + b%y = fe, 

they may be fomid as follows. Multiply the members of the first equa- 
tion by &2 and those of the second equation by —61, and add the resulting 
equations. We get 

(ai62 ~ (ijbi)x = ^162 ~ kjbu 

Emplo3ring the respective multipliers — Oa and ai, we get 

(ai62 — oJ^\)y = aife — (hhi' 
The common multiplier of x and y is 

(2) ai&2 — 0261, 

which is called a dei^rminant of the second order and denoted by the 
symbol* 

The value of the symbol is obtained by cross-multiplication and substrac- 
tion. Our earlier results now give 



(20 



CLl hi 




fci 61 




Oi 61 




Oi A;i 


02 62 


X — 


h 62 


9 


02 62 


y = 


02 h 



(3) 



We shall call k\ and A^ the known terms of our equations (1). Hence, 
if D is the determinant of the coefficients of the unknowns^ the product of D by 
any one of the unknowns eqv/ds the determinant obtained from D by substi- 
tuting the known terms in place of the coefficients of that unknown. 

* The symbol for an expression should show explicitly all of the quantities upon 
whose values the value of the expression depends. Here these are aiy biy 0%, h%. The 
advantage of writing these in the symbol (2') in the order in which they occur in the 
equations is that the symbol may be written down without an effort of memory by a 
mere inspection of the given equations. 
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Example. For2x — 3y=— 4, 6z — 2y=«2, we have 

14a; = 14, 

= 28, J/ = 2. 



2 


-3 




-4 -3 


6 


-2 


z = 


2 -2 




142/ = 


2 -4 
6 2 



1, 



EXERCISES 

Solve by determinants the systems of equations 

--1. 8x-y = 34, ^2. 3x + 4y = 10, 
x + 8y = 53. 4x + y =9. 



3. ax + hy ^^ a\ 
bx — ay = ab. 



^ 4. Verify that, if the determinant (2) is not zero, the values of x and y deter- 
mined by division from (3) satisfy equations (1). 

2. Consider a system of three linear equations 

aix + biy + Ciz = fci, 

(4) ojx + 622/ + caz = fe, 

a»x + 6ji/ + C82 = fcs. 

Multiply the members of the first, second and third equations by * 

(5) 63C3 — 6jC2, 63C1 — bidy biCi — b^Qi, 

respectively and add the resulting equations. We obtain an equation 
in which the coeflBcients of y and z are found to be zero, while the coeffi- 
cient of X is 

(6) 0162^8 — dibzCi + aJbiCi — OjfciCs + Ch/biC^ — aJb^Ci. 

Such an expression is called a determinant of the third order and denoted 
by the symbol 

tti 61 Ci 
(60 02 62 C2 

as &3 Cz 

The nine numbers ai, . . . , Cs are called the elements of the determi- 
nant. In the symbol these elements lie in three (horizontal) rows, and 
also in three (vertical) columns. Thus 05, 62, C2 are the elements of the 
second row, while the three c's are the elements of the third colunm. 

* A simple rule for finding these multipliers is given in { 3. 
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The equation (free of y and 2), obtained above, is 



Ci bi Ci 




fcl 61 Ci 


(h ^2 C2 


z = 


fe 62 C2 


Os &3 Cs 




A^s &3 Cs 



since the constant member was the sum of the products of the expres- 
sions (5) by fci, ^2, h, and hence may be derived from (6) by replacing 
the a's by the fc's. Thus the theorem of § 1 holds here as regards the 
value of X. 

3. Minors. The determinant of the second order obtained by eras- 
ing (or covering up) the row and column crossing at a given element of a 
determinant of the third order is called the minor of that element. For 
example, in the determinant D given by (6')) the minors of ai, 02, a« are 



Ai = 



62 C2 
bz Cz 



A,= 



bi ci 
bz cz 



ils = 



bi ci 
62 Pa 



respectively. The multipliers (5) are therefore Ai, — Aa, Az. Hence the 
first results obtained in § 2 may be stated as follows: 

(7) D = aiAi — 0/2^2 + (hAzy 

(8) biAi — &2A2 + biAz = 0, CiAi — c^A^ + czAz = 0. 
The minors of 61, 62, bz in this determinant D are 

B\ = dzCz — a«C2, B2 = diCz — dzCi, Bz == (X1C2 — (hCi. 

Multiply the members of the equations (4) by — Bi, B2, —Bzj respectively, 
and add. In the resulting equation, the coefficients of x and z are seen to 
equal zero: 

(9) -aiBi + oiBi - OsBs = 0, -CiBi + C2S2 - CjBs = 0, 
while the coefficient of y is seen to equal the expression (6) : 

(10) D = -61B2 + &2B2 -bzBz. 
Hence the theorem of § I holds here for the variable y. 

The reader should also verify that, if he uses the multipliers Ci, — C2, Ca, 
where Ci is the minor of Ci in D, he obtains an equation in which the co- 
efficients of X and y are zero: 

(11) aiCi - 0202 + azCz = 0, biCi - 62C2 + bzCz = 0, 
while the coefficient of z equals the expression (6) : 

(12) D = CiCi - C2C2 + czCzy 

and then conclude that the theorem of § 1 is true as regards z. 
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4. Eiqiansion According to the Elements of a Column. Relations (7), 
(10), (12) are expressed in words by saying that a determinarU of the third 
order may be expanded according to the elements of any column. To obtain 
the expansion, we multiply each element of the colunm by the minor of 
the element, prefix the proper sign to the products, and add the signed 
products. The signs are alternately + and — , as in the diagram. 

+ - + 
- + - 
+ - + 

6. Two Columns Alike. A determinant * is zero if any two of its 
columns are alike. 
This is evident for a determinant of the second order: 



c c 
dd 



= cd — cd = 0. 



To prove it for a determinant of the third order, we have only to expand 
it according to the elements of the column not one of the like columns 
and to note that each minor is zero, being a determinant of the second 
order with two columns alike. 

EXERCISES 

Solve by determinants the systems of equations (expanding a determinant having 
two zeros in a colunm according to the elements of that coimnn) : 

"^1. x+y+2=ll, "^2. x+2/+2 = 0, 

2x-6y- 2 = 0, x + 22/ + 32=-l, 

3x + 4y + 22 = 0. x + 3y+62 = 0. 

3. Noting that Ai, At, As of § 3 do not involve ai, os, Oi, we may obtain the 
first expression (8) from (7) by replacing each ai by 6i, and the second expression (8) 
from (7) by replacing each a*- by Ci, Hence (8) are the expansions of 

Ci hi ci 

C2 ht Ci =0 

C3 6i ca 

according to the elements of the first column. 

4. Prove similarly that (9) and (11) follow from { 5. 

* Here and in {§ 6-11 we understand by a determinant one of the second or third 
order. After determinants of higher orders have been defined, it will be shown that 
these theorems are true of determinants of any order. 



hi hi Ci 




ht ht Ci 


= 0, 


6i 6i ca 
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6. Theorem. A determinant having Oi + ^i, 02 + ^2,... o« the eU- 
ments of a column equals the sum of the determinant having ai, a^, . . . as 
the elements of the corresponding column and the determinant having gi, qz, 
. . . as the elements of that column, while the elements of the remaining 
columns of each determinant are the same os in the given determinant. 

For determinants of the second order, there are only two cases: 



Oi + Qi 61 
02 + g2 &2 




Oi 61 

02 62 


+ 


gi fci 
g2 62 


61 oi + gi 

62 02 + g2 


= 


61 Oi 

62 02 


+ 


fci gi 
62 g2 





Oi 61 Ci 




gi bi ci 


=: 


02 62 C2 


+ 


g2 62 C2 




a^ bz Cz 




gs 6j C8 



For determinants of the third order, one of the three cases is 

Oi + gi 61 Ci 
02 + g2 &2 C2 
os + ga &8 cs 

To prove the theorem we have only to expand the three determinants 
according to the elements of the column in question (the first colunm in 
the first and third illustrations, the second column in the second illustra- 
tion) and note that the minors are the same for all three determinants. 
Hence Oi + gi is multiplied by the same minor that Oi and gi are multi- 
plied by separately, and similarly for 02 + g2, etc. 



7. Removal of Factors. A common factor of all of the elements of the 
same column of a determinant m>ay he divided out of the elements and placed 
as a factor before the new determinant. 

In other words, if all of the elements of a column are divided by n, the 
value of the determinant is divided by n. For example, 



nai 61 


= n 


Oi 61 


no2 &2 




02 62 



Oi nbi Ci 




Oi 61 Ci 


02 n&2 C2 


= n 


O2 &2 C2 


03 nbi Ct 




Oi bz Ct 



Proof is made by expanding the determinants according to the elements 
of the column in question. 
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8. Theorem. A determinant is not changed in value if we add to the 
elements of any column the 'products of the corresponding elements of another 
column by the sams number. 



For example. 



a\ + nbi h\ 

02 + TI&2 ^2 



02 h% 





Oi 6i Ci 




6i 6i c\ 


=5 


0/2 h% Oi 


+ n 


&2 &2 C2 




03 bs ^ 




h%h% Ci 



as follows from the first relation in § 6. Similarly, by the third, 

Oi + nbi 6i Ci 

02 + TI&2 ^2 C2 

03 + nfta fts Cs 

in which the last determinant is zero by § 5. 

In general, let Oi, 02, . . . be the elements to which we add the products 
of the elements 61, 62, . . . by n. We apply § 6 with q\ = nbi, q% = n&2, .... 
Thus the modified determinant equals the simi of the initial determinant 
and a determinant having bi, &2> . . . in one column and nbi, rtbi, ... in 
another column. But the latter determinant equals (§7) the product 
of n by a determinant with two colunms alike and hence is zero (§ 5). 

Example. Multiplying the elements of the last colunm by 2 and adding the 
products to the elements of the second column, we get 



1 
1 
6 



-2 
2 
4 



1 
3 
3 



1 
1 
6 





8 
10 



1 
3 
3 





-2 

3 





8 

10 



1 




-2 8 


3 


= 


3 10 


3 




• 



= -44. 



For the next step, we have multiplied the elements of the third colimm by — 1 
and added the products to the elements of the first column. Expanding the third 
determinant according to the elements of the third column, we note that two of 
the minors are zero (having a row of zeros), and hence obtain the determinant of 
the second order written above. The last step is simplified by use of § 10. 

9. Interchange of Rows and Columns. A determinant is not altered 

if in its symbol we take as the elements of the first, second, . . . rows the 
elements (in the same order) which formerly appeared in the first, second, . . . 
columns: 



D^ 



Oi 61 

02 &2 

Oi &i Ci 

02 &2 C2 

Os &S Cs 



Oi O2 
61 62 
Oi O2 O3 
61 62 ^3 
Ci C2 C3 



= A. 



i 10. 11] 



DETERMINANTS 



133 



The proof is evident by inspection for the case of determinants of the 
second order. For those of the third order, we expand A and find that its 
six terms are those in the expansion (6) of D. 

10. Expansion According to the Elements of a Row. To prove that 
determinant D, given by (6'), inay be expanded according to the elements of 
any row (say the second *) : 

D = —02^2 + b2B2 — C2C2, 



with the same rule of signs as in § 4, we note that (§9) 



2) = A= -02 



61 63 

Ci Cz 



+ 62 



Oi Os 



- C2 



fli Os 

61 h 



since A can be expanded according to the elements of its second column. 
After interchanging the rows and columns in these three determinants of 
the second order, we have the minors A2, B2, C2 of 02, 62, C2 in D. 

Example. The third determinant in the Example of § 8 is best evaluated by 
expanding it according to the elements of its first row, since two of its elements are 
zero. Indeed, we obtain +1 multiplied by its minor. 

11. Theorem. A determinant is not changed in value if we add to the 
elements of any row the products of the corresponding elements of another 
row by the same number. 

We shall show that D, given by (6'), equals 

Oi 61 Ci 

D'= a2 + nai 62 + ^1 C2 + nci 
az bz Cz 

Now D = A, where A is given in § 9. By § 8, 

fli a>2 -\- nai az 

A = 61 62 + w6i bz 

Ci C2 + nci Cz 

Interchanging the rows and columns of A, we get D\ Hence 

D' = A = D. 

* While for concreteness we have here (and in § 11) treated but one of several cases, 
the proof is such that it applies to all the cases. 
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EXERCISES 

1. Evaluate the numerical determinant in { 8 by removing the factor 2 from 
the second colunm and then getting a determinant with two zeros in the second 
row. 

Solve the systems of equations (by removing, if possible, integral factors from a 
colunm and reducing each determinant to one with t^o zeros in a row before 
expanding it) : 

"-2. x-2y+ 2=12, 3. 3z-22/ = 7, 

z + 22/ + 3z = 48, 32/-2z = 6, 

6z + 4y + 3« = 84. 3z-2x=~l. 

Factor a single determinant, and solve 

^4. x+ y+ z = lf ^5. ax+by+cs^k, 

ax + by + cz == k, a^x + b^ + ch = k^, 

aht + ly^ + chi'=k\ a*x + b^ + c^ = k^. 

^6. Obtain in its simplest form the value of x from 

ac+ y + 2 = a — 3, 
x + ay+ 2 = -2, 
x+ y + az = —2. 

7. Deduce the case n = 2 of § 7 at once from { 6, by taking Qi = a<. 

8. Give the proof in § 10 when the third row is used. 

9. Give the proof in § 11 for a new case. 

10. A determinant of the third order b zero if two rows are alike. 

11. Hence prove that Z>' = Z> in § 11 by expanding D' according to the dements 
of its second row. 

12. Prove the theorem about rows corresponding to that in § 6. 

13. From Ex. 12 deduce Ex; 11. 

12. Definition of a Determinant of Order n. In the six terms of the 
expression (6), which was defined to be the general determinant of order 3, 
the letters a, 6, c were always written in this sequence, while the sub- 
scripts are the six possible arrangements of the numbers 1, 2, 3. The first 
term 0162^3 shall be called the diagonal term* since it is the product of the 
elements in the main diagonal running from the upper left hand comer to 
the lower right hand comer of the symbol for the determinant. The 
subscripts in the term —aib^ are derived from those of the diagonal 
term by interchanging 2 and 3, and the minus sign is to be associated 
with the fact that an odd number (here one) of interchanges of subscripts 
were used. To obtain the arrangement 2, 3, 1 of the subscripts in the 

* Sometimes called the leading term. 
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term +026301 from the natural order 1, 2, 3 (in the diagonal term), we 
may first interchange 1 and 2, obtaining 2, 1, 3 and then interchange 
1 and 3; an even nimiber (two) of interchanges of subscripts were used 
and the sign of the term is plus. 

EXERCISES 

1. Show that a like result holds for the last three terms of (6). 

2. Discuss similarly the two terms of a determinant of order 2. 

While the arrangement 1, 3, 2 was obtained from 1, 2, 3 by one inter- 
change (2, 3), we may obtain it by applying in succession the three inter- 
changes (1, 2), (1, 3), (1, 2), and in many new ways. To show that the 
number of interchanges which will produce the final arrangement 1, 3, 2 
is odd in every case, note that any interchange (the possible ones being 
the three just listed) changes the sign of the product 

P = (Xi - X^{Xi - Xz)(pC2 - X3), 

where the x's are arbitrary variables. Thus a succession of k interchanges 
yields P or —P according as A: is even or odd. Starting with the arrange- 
ment 1, 2, 3 and applying k successive interchanges, suppose that we 
obtain the final arrangement 1, 3, 2. But if in P we replace the subscripts 
1, 2, 3 by 1, 3, 2, respectively, i.e., if we interchange 2 and 3, we obtain 
—P. Hence A: is odd. 

Consider the corresponding question for n variables. Form the prod- 
uct of all of the dififerences Xi — Xy (i < j) of the variables: 

P = (Xi - X2)(Xi - Xs) . . . (Xi — Xn) 
• (X2 — Xs) . . . (X2 — Xn) 



• (X,^i — Xn). 

Interchange any two subscripts i and j. The factors which involve neither 
i nor j are unaltered. The factor =t(Xi — x,) involving both is changed 
in sign. The remaining factors may be paired to form the products 

=b(Xi - Xifc)(xy - x*) (fc = 1, . . . , n; fc ?£ i, ky^j). 

Such a product is unaltered. Hence P is changed in sign. 

Suppose that an arrangement ii, 12, . . . , in can be obtained from 
1, 2, . . . , n by using a successive interchanges and also by 6 successive 
interchanges. Make these interchanges on the subscripts in P; the 
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resulting functions equal ( — 1)"^ and ( — 1)*P, respectively. But the 
resulting functions are identical since either can be obtained at one step 
from P by replacing the subscript 1 by ii, 2 by 12, . . . , n by in. Hence 

SO that a and b are both even or both odd. 
We define a determinant of order 4 to be 



(13) 



Oi 61 Ci di 

02 h2 C2 (h 

az bz Cz dz 

a^ 64 C4 di 



= 2 =^ ^Q^rCsdtf 

(24) 



where q, r, s, t is any one of the 24 arrangements of 1, 2, 3, 4, and the 
sign of the corresponding term is + or — according as an even or odd 
number of interchanges are needed to derive this arrangement q, r, 8, t 
from 1, 2, 3, 4. Although different numbers of interchanges will produce 
the same arrangement g, r, s, t from 1, 2, 3, 4, these numbers are all even 
or all odd, as just proved, so that the sign is fully determined. 

We have seen that the analogous definitions of determinants of orders 
2 and 3 lead to our earlier expressions (2) and (6). 

We will have no difficulty in extending the definition to a determinant 
of general order n as soon as we decide upon a proper notation for the n* 
elements. The subscripts 1, 2, . . . , n may be used as before to specify 
the rows. But the alphabet does not contain n letters with which to 
specify the columns. The use of e', e", . . . , e^**) for this purpose would 
conflict with the notation for derivatives and besides be very awkward 
when exponents are used. It is customary in mathematical journals and 
scientific books (a custom not always followed in introductory text books, 
to the distinct disadvantage of the reader) to denote the n letters used to 
distinguish the n columns by 61, 62, ... , Cn (or some other letter with 
the same subscripts) and to prefix (but see § 13) such a subscript by 
the subscript indicating the row. The symbol for the determinant is 
therefore 



(14) 



D = 



Cii 612 ... 61 



621 622 



Ci 



^nl ^n2 ... 6 



nn 
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By definition * this shall mean the sum of the n! terms 
(14') (-l)'e,-,iei,2 . . . e»> 

in which ii, 12, . . . , in is an arrangement of 1, 2, . . . , n, derived from 
1, 2, . . . , n by i interchanges. For example, if we take n = 4 and 
write Gj, bj, Cy, d, for Cyi, 6/2, 6/3, e^A, the symbol (14) becomes (13) and the 
general term (14') becomes (— 1)* a»j 6», c,, d,^, the general term of the second 
member of (13). 

EXERCISES 

1. Give the six terms involving oj in the determinant (13). 

2. What are the signs of 0365^2^164, aJbACtd^i in a determinant of order five? 

3. The arrangement 4, 1, 3, 2 may be obtained from 1, 2, 3, 4 by use of the two 
successive interchanges (1, 4), (1, 2), and also by use of the four successive inter- 
changes (1, 4), (1, 3), (1, 2), (2, 3). 

4. Write out the six terms of (14) for n = 3, rearrange the factors of each term 
so that the new first subscripts shall be in the order 1, 2, 3, and verify that the 
resulting six terms are those of the expansion of D' in § 13 for n = 3. 

13. Interchange of Rows and Columns. Determinant (14) equals 



D' = 



en 621 .. . 6ni 

612 622 • . . Cn2 



^In ^n • • • ^ 



nn 



Without altering (14'), we may rearrange its factors so that the first 
subscripts shall appear in the order 1, 2, . . . , n, and get 

{ — \ye\kfi2kt . . . Cnk^* 

Since this can be done by i interchanges of the letters e (corresponding to 
the i interchanges by which the first subscripts ii, . . . , in were derived 
from 1, . . . , n), the new second subscripts fci, . . . , fcn are derived from 
the old second subscripts 1, . . . , n by i interchanges. The resulting 
signed product is therefore a term of D'. Hence D = D'. 

* We may define a determinant of order n by mathematical induction from n — 1 
to n, using the first equation in § 17. The next step would be to prove that the present 
definition holds as a theorem. ' 
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14. Interchange of Two Columns. A determinarU is changed in sign 
by the interchange of any two of its columns. 

Let A be the determinant derived from (14) by the interchange of the 
rth and sth colmnns. The expansion of A is therefore obtained from 
that of D by interchanging r and s in the series of second subscripts of 
each term (14') of D. Interchange the rth and sth letters e to restore 
the second subscripts to their natural order. Since the first subscripts 
have undergone an interchange, the negative of any term of A is a term 
of D, and A = -D. 

16. Interchange of Two Rows. A determinant D is changed in sign 
by the interchange of any two rows. 

Let A be the determinant obtained from D by interchanging the rth 
and sth rows. By interchanging the rows and columns in D and in A, we 
get two determinants D' and A', either of which may be derived from the 
other by the interchange of the rth and sth columns. Hence, by §§ 13, 14, 

A = A' = -D' = -D. 

16. Two Rows or Two Columns Alike. A determinant is zero if any 
two of its rows or any two of its columns are alike. 

For, by the interchange of the two like rows or two like columns, the 
determinant is evidently unaltered, and yet must change in sign by §§ 14, 
15. Hence D = -D, D = 0. 

17. Expansion. A determinant can be expanded according to the efe- 
ments of any row or any column. 

Let Eij be the minor of e,/ in D, given by (14). Thus -B,-,- is the deter- 
minant of order n — 1 obtained by erasing the ith row and the jth colunm 
(crossing at e,,). We first prove that 

D = 611^11 -ejifiji + e^iEn - . . . + (-l)»-ieniSni, 

so that D can be expanded according to the elements of its first column. 
The terms of D with the factor en are of the form 

where 12, . . . , in is an arrangement of 2, . . . , n derived from the latter 
by i interchanges. Removing from each term the factor en, and adding 
the quotients, we obtain the (n — 1)! properly signed terms of Eu- 
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Let A be the determinant obtained from D by interchanging the first 
and second rows. As just proved, the total coefficient of 621 in A is the 
minor 

612 618 ... ein 

€32 €zi ... 63n 



^n2 ^n3 • • • ^nn 



of 621 in A. Now this minor is identical with -B21. But A = — D (§ 15). 
Hence the total coefficient of 621 in D equals —£21. 

Similarly, the coefficient of e^i is Ezu etc. 

To obtain the expansion of D according to the elements of its Arth col- 
umn, where fc > 1, we consider the determinant 8 derived from D by 
moving the fcth column over the earlier columns until it becomes the new 
first colimm. 

Since this may be done by fc — 1 interchanges of adj&cent columns, 
5 = (— 1)*~^D. The minors of the elements eu, . . . , 6njb in the first 
column of 8 are evidently the minors £u, . . . , Enk of eu, . . . , 6nib in D. 
Hence, by the earlier result, 

n 

(15) D = Xi-iy-^'CikEiit (fc = 1, . . . , n). 

Applying this result to the equal determinant D' of § 13, and changing 
the summation index from j to k, we get 

n 

(16) D=X (- l)^*ey»^;* 0' = 1, • ■. . , n). 

i-1 

This gives the expansion of D according to the elements of the jth row. 
One decided advantage of the double subscript notation is the resulting 
simplicity of the last two expansions. Of course the sign may also be 
found by counting spaces as in § 4. 

18. The theorems in §§ 6-8, 11 now follow for determinants of order n. 
Indeed, the proofs were so worded that they now apply, since the auxiliary 
theorems used have been extended (§§ 13, 16, 17) to determinants of 
order n. 
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EXERCISES 
1. Prove the theorem of § 15 by the direct method of § 14. 



2. 



h + c c + a a +b 
hi + Ci Ci + ai ai + hi 
ht + Ct Ci + Ot Oi + bi 



= 2 



a b c 
ai bi Ci 
Ot bi Ci 



By reducing to a determinant of order 3, etc., prove that 



\ 



3. 



5. 



a 



a" 



a^ 



2 
1 
3 
4 

b 

&» 
6* 



-1 3 

7 1 

.5 -5 

-3 2 

d 



-2 

-1 

3 

-1 



4. 



= -42. 



1 
1 
1 
1 



1 
2 
3 
4 



1 1 

3 4 

6 10 

10 20 



= 1. 



c 






= abcd{a - 6)(a - c)ia - d)(6 - c)(6 - d)(c - d). 



--6. 



7. 



ae + 6^ af+hh 
ce + dg cf + dh 



a b 
c d 



e f 
g h 



[use § 6]. 



ai 


bi 


Ci 




(H 


6i 


Ci 


• 


(h 


bt 


cz 





aiCi + biCt + Cid aji + bjz + Cj/s aiQi + biQt + CiQz 
ojei + btst + Cifii (hfi + 62/2 + C2/3 02^1 + 62^2 + CiQi 
aiei + 6562 + CzCi asfi + 63/2 + Cifi c^i + bzgt + Cigi 

ei /i Qi 
ei fi 92 
ez fz Qz 

Write out only the 6 of the 27 determinants (§ 6) which are not necessarily zero. 

8. Hence verify that the product of two determinants of the same order (2 or 3) 
is a determinant of like order in which the element of the rth row and cth column 
is the sum of the products of the elements of the rth row of the first determinant 
by the corresponding elements of the cth column of the second. 

9. Express (a» + 6* + c» + cP)(e2 + P + g^ + h^) bs a sum of 4 squares by 
writing 

« + A* 9 -\- hi 



a-{-bi c + di 
—c -{• di a — 61 



—9 + hi e — fi 



as a determinant of order 2 similar to each factor. 
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10. If 8i = a» + /3* + 7«, 

1 1 



1 a a> 




1 fi 0' 


z= 


1 7 7* 





3 8i 82 
Si 8t S3 
Si S3 S4 



11. Using the Factor Theorem and the diagonal term, prove Ex. 5 and 



1 1 ... 1 

XX X] ... x^ 
Xi X2 ... Xi) 



^^n-l^jfi-l ^ ^ Xn**-^ 



n 



n(n-l) 



= n(^i-^y) = (-i) ' ^, 



* * « 



where P is given in § 12. 

12. With the notations of § 3, and using (7)-(12), prove that 

D 
D 
D 

Hence the first determinant equals D^. 

19. Complementary Minors. The determinant D of order 4 in (13) 
is said to have the two-rowed complementary minors 



Ax -At 


A, 




-B, B, 


-Bt 


• 


C\ —Ci 


c, 





ai 


h 


Ci 




Oj 


62 


Cj 


= 


az 


&8 


C3 





M = 



03 hz 



ilf' = 



C2 (fc 
C4 d\ 



since either is obtained by erasing from D all the rows and colimms having 
an element occurring in the other. Similarly, any r-rowed minor of a 
determinant of order n has a definite complementary (n — r)-rowed 
minor. In particular, any element is regarded as a one-rowed minor 
and is complementary to its minor. 

20. Laplace's Development. Any determinant D equals the sum of 
all the products ± MM', where M is an r-rowed minor hamng its elements 
in the first r columns of D, and Af ' is the minor complementary to M, while 
the sign is + or — axxording as an even or odd number of interchanges of 
rows of D will bring M into the position occupied by the minor Af 1 whose 
elements lie in the first r rows and first r columns of D. 
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For r =s ly this development becomes the known expansion of D according to 
the elements of the first colunm (§ 17) ; here Af i = 611. 
If r = 2 and D is the determinant (13) of order 4, 



D = 


ai 61 
(h bi 


• 


Cz dz 
Ci di 


— 


ai 61 
Oz bz 


• 


Ci di 
Ci di 


+ 


ai 61 
Oi bi 


• 


Ci di 
Cz dz 


+ 


Oi bi 
Oz bz 


• 


ci di 
Ci di 


— 


(h bi 
Oi bi 


• 


Ci di 
Cz dz 


+ 


Oz bz 
Oi bi 


• 


ci di 
Ci di 



The first term of the development is M lilf /; the second term is — ilf ilf '(in the nota- 
tions of § 19), and the sign is minus since the interchange of the second and third 
rows of D brings this M into the position of Mi. The sign of the third term of the 
development is plus since two interchanges of rows of D bring the first factor 
into the position of Mi. 



If D is the detenninant (14), then 



Mi = 



fill . . . 61, 



en 



• • C/f 



Mi' = 



^r+lr+l . • . 6r+in 



€n r+1 • • • 6 



n n 



Any term of the product Af lAf / is of the type 

(— l)*e„iej^ . . . 6.V* ("l)'^<r+iH-i . . . e 



»»»> 



where ii, . . . , v is an arrangement of 1, . . . , r derived from 1, . . . , r 
by I interchanges, while v+i, . . . , in is an arrangement of r + 1, . . . , n 
derived by j interchanges. Hence ii, . . . , in is an arrangement of 
1, . . . , n derived by i + j interchanges, so that the above product is a 
term of D with the proper sign. 

It now follows from § 15 that any term of any of the products dt MM' 
of the theorem is a term of D. Clearly we do not obtain in this manner 
the same term of D twice. 

Conversely, any term t of D occurs in one of the products dt MM'. 
Indeed, i contains as factors r elements from the first r colunms of Z), 
no two being in the same row, and the product of these is, except per- 
haps as to sign, a term of some minor M . Thus Ms a term of MM' or 
of ~MM'. In view of the earlier discussion, the sign of i is that of the 
corresponding term in d: MM\ where the latter sign is given by the 
theorem. 
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21. There is a Laplace development of D in which the r-rowed minors 
M have their elements in the first r rows of D, instead of in the first r 
^jjolumns as in § 20. To prove this, we have only to apply § 20 to the equal 
determinant obtained by interchanging the rows and columns of D. 

There are more general (but less used) Laplace developments in which 
the r-rowed minors M have their elements in any chosen r columns (or 
rows) of D. It is simpler to apply the earlier developments to the de- 
terminant it D having the elements of the chosen r colunms (or rows) 
in the new first r columns (or rows). 



BXERCISES 



L 



abed 










e f g h 




a b 




3 k 


j k 




e f 




I m 


I m 











2. 

1 
2 



a 
e 
a 
e 



b c d 

f g h 

bed 

f g h 





a b 




• 

c d 




a c 




b d 




a d 




6 c 


^ 




• 




^ 




• 




+ 




• 






e f 




9 h 




e g 




f h 




e h 




/ 9 



= 0. 



3. Check § 20 by showing that the total number of products of n elements is 
Cr" • r!(n — r)I = nl, where C/^ is the number of combinatioDs of n things r at a 
time. 

For Laplace's development of many special determinants, see Ch. XII. 

• 

22. Product of Determinants. The important rule (Ex. 8, p. 140), 
for expressing the product of two determinants of order n as a determi- 
nant of order n is found and proved easily by means of Laplace's develop- 
ment. For brevity we shall take n = 3, but the method is seen to apply 
for any n. We have 



Oi 6i Ci 

02 &2 ^2 

Oi bz Cz 



ei /i 


Qi 




et ft 


g* 


^ 


e» ft 


gt 





Oi 


bi 


Cl 











<h 


bt 


Ci 











at 


bt 


Ct 











1 








«i 


/i 


ffi 





-1 





et 


/. 


gt 








-1 


e* 


ft 


g* 
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In the determinant of order 6, multiply the elements of the first colunm 
by ei, /i, gi in turn and add the products to the corresponding elements of 
the fourth, fifth and sixth columns, respectively (and hence introduce 
zeros in place of the present elements ci, /i, gi). Then multiply the ele- 
ments of the second column by e2, /z, ^2 in turn and add the products to 
the corresponding elements of the fourth, fifth and sixth columns, re- 
spectively. Finally, multiply the elements of the third column by ea, /s, gt 
in turn and add as before. The new determinant is 

aiei+bie2+Ciez ai/i+fei/z+Ci/s ai^i+6i^2+Ci^8 

<hfii+h7fi2+C7fii 02/1+62/2+02/3 (hgi+htgi+c^ 

a^ei+hzet+c^z 03/1+68/2+03/3 a^i+h^i+c^t 







By Laplace's development (or by expansion according to the elements of 
the last row, etc.), this equals the 3-rowed minor whose elements are the 
long sums, and written in Ex. 7, p. 140. 

23. Systems of Linear Equations. In the n equations 

OiiXi + 012X2 + • • • + dlnXn = fcl, 

(17) 

OnlXi + On2X2 + • • • + annXn = *„, 

let D denote the determinant of the coefficients of the n unknowns: 



ai 


61 


Cl 


at 


ht 


Ct 


a» 


h 


Cs 


1 











-1 











-1 



D = 



On O12 . . . Oin 



Onl On2 • • • CLnn 

Let Aij be the minor of a^ in D. Multiply the members of the first equa- 
tion by All, those of the second equation by — A21, . . . , those of the nth 
equation by ( — l)'*"*ilni, and add. The coefficient of Xi is the expansion 
of D according to the elements of its first column. The coefficient of Zz 
is the expansion, according to the elements of the first colunm, of a deter- 
minant derived from D by replacing On by O12, . . . , o^i by o,»2, so that 
this determinant has the first two columns alike and hence is zero. In 
this manner, we find that 

(18) Dxi = Ku Dxt = Kt, . . . , Dzn=^ Kn, 
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in which (see ()3) of § 24) Ki is derived from D by substituting fci, . . . , fcn 
for the elements ai„ . . . , a^i of the ith column of D. Another proof of 
(18) follows from 



Dzi = 



(ZiiXi ai2 . . .Clin 



dnlXl (Zn2 • • • CI 



nn 



aiiXi + 



+ dlnXn . . . ai, 



an lXi+ 



I ^nn^^n • . • (I 



nn 



=Ki. 



We have now extended to any n results proved for n = 2 and n = 3 
in §§ 1-3. 

If D 9^ Oj the unique values of Xi, . . . , Xn determined by division 
from (18) actually satisfy equations (17). For instance, the first equation 
is satisfied since 

Ki CLii CL]2 • • • Clin 



kiD — GnKi — aviKi — 



— ainKn = 



k\ dw {ii2 • . • (Zii 

hi 021 022 ... 02 1 



dnl fln2 ... a 



nn 



as shown by expansion according to the elements of the first row; and 
the determinant is zero, having two rows alike. 



24. Rank of a Determinant. If a determinant D of order n is not 
zero, it is said to be of rank n. In general, if some r-rowed minor of D is 
not zero, while every (r + l)-rowed minor is zero, D is said to be of 
rank r. 

For example, a determinant D of order 3 is of rank 3 if D 5^ 0; of rank 2 if D = 0, 
but some two-rowed minor is not zero; it is of rank 1 if every two-rowed minor 
is zero, but some element is not zero. It is said to be of rank if every element is 
zero. 

In the discussion of the three equations (4), five cases arise: 

(a) D of rank 3, I.e., D 5^ 0. 

(fi) D of rank 2 (i.e., D = 0, but some two-rowed minor 5^ 0), and 



^1 = 



k\ h\ c\ 
kt ht Ct 
kt bi Ct 



Kt^ 



ai ki Ci 
(h ki Ci 
(h kt Cz 



Kt 



Qi hi ki 
oi ht k% 

08 &8 A^ 



not all zero. 



a. 


ki 


1 


bi 


ki 


a 


Ci 


ki 


a,- 


hi 


bi 


ki 


/ 


C; 


ki 
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(7) D of rank 2 and /Ti, X2, i^s all zero. 

(5) D of rank 1 (i.e., every two-rowed .minor = 0, but some element ^ 0), and 

(i, j chosen from 1, 2, 3) 

not all zero; there are mne such determinants K, 

(c) D of rank 1, and all nine of the determinants K zero. 

In case (a) the equations have a single set of solutions (§ 23). In cases (fi) and 
(a) there is no set of solutions. In case (7) one of the equations is a linear com- 
bination of the other two; for example, if aJh — ajbi 5^ 0, the first two equations 
determine x and y as linear functions of z (as shown by transposing the terms in z 
and solving the resulting equations for x and t/), and the resulting values of x and 
y satisfy the third equation identically as to z. Finally, in case («), two of the 
equations are obtained by multipljring the remaining one by constants. For 
(fi) the proof follows from (18). For (7), («), (e), the proof is given in § 25. 

The reader acquainted with the elements of solid analytic geometry will see 
that the planes represented by the three equations have the following relations: 

(a) The 3 planes intersect in a single point. 

(ff) Two of the planes intersect in a line parallel to the third plane. 

(7) The 3 planes intersect in a conmion line. 

(«) The 3 planes are parallel and not all coincident. 

(e) The 3 planes coincide. 

26. Fundamental Theorem. Let the determinant D of the coeficients 
of the unknowns in equaiioTis (17) he of rank r, r < n. If the determinants 
K obtained from the (r + l)'rowed minors of D by replacing the elements 
of any column by the corresponding known terms ki are not aU zero, the equa- 
tions are inconsistent. Bui if these determinants K are all zero, the r equor' 
tions involving the elements of a non-vanishing r-rowed minor of D determine 
uniquely r of the variables as linear functions of the remaining n — r vari' 
ables, and the expressions for these r variables satisfy also the remaining 
n — r equations. 

For example, let r = n — 1. Then D = and the K^s are the Ki, . . . , 
Kn of § 23. Hence, by (18), the equations are inconsistent unless 
Ki, . . . , Kn are all zero. This affords an illustration of the following 

Lemma 1. If every (r + l)-rowed mmor M formed from certain 
r + I rows of D is zero, the corresponding r + 1 equations (17) are incon- 
sistent if there is a non-vanishing determinant K formed from any M 
by replacing the elements of any colunm by the corresponding known 
terms fc,-. 
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For concreteness,* let the rows in question be the first r + 1 and let 

du . • . tZif ki 



K= 



9-^0. 



Let di, . . . , dr+i be the minors of fci, . . . , kr+i in K. Multiply the 
first r + 1 equations (17) by di, — cfe, • • • i ( — l)''dr+i, respectively, and 
add. The right member of the resulting equation ia :h K. The coeflS- 
cient of x« is 

(111 • . . dir CXi g 



dr+ll • • . dr+lr Ctr+l • 

and is zero, being an M. Hence = iLK. 

Lemma 2. If all of the determinants M and K in Lemma 1 are zero, 
but an r-rowed minor of an M is not zero, one of the corresponding r + 1 
equations is a linear combination of the remaining r equations. 

As before let the r + I rows in question be the first r + 1. Let the 
non-vanishing r-rowed minor be 

Oil . . . Oir 



(19) 



dr+l = 



drl 



a 



rr 



5^0. 



Let the functions obtained by transposing the terms fc< in (17) be 



Li = a.iXi + ai2Xi + 



+ dinXn — ki. 



By the multiplication made in the proof of Lemma 1, 

diLi - dzLs + • • • + (-l)''dr+iLr4.i = =F i?: = 0. 
Hence Lr+i is a linear combination of Li, . . . , Lr. 

The first part of the fundamental theorem is true by Lenmia 1. The 
second part is readily proved by means of Lemma 2. Let (19) be the 
non-vanishing r-rowed minor of D. For s > r, the sth equation is a 
linear combination of the first r equations, and hence is satisfied by any 
set of solutions of the latter. In the latter transpose the terms involving 
Xr+iy . . . , Xn. Since the determinant of the coefficients of Xi, . . . , Xr 
is not zero, § 23 shows that Xi, . . . , Xr are uniquely determined linear 
functions of Xr+i, . . . , Xn (which enter from the new right members). 

* All other cases may be reduced to this one by rearranging the n equations and 
relabelling the unknowns (replacing Xa by the new Xi, for example). 
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EXERCISES 

1. Write out the proof of the theorem in § 25 for the cases (7), («), («) in § 24. 
Discuss the following systems of equations: 

^2. 2x+ y + 3z=h 3. 2x+ t/+3z=l, 

4x + 2y- z=-3, 4x + 2y- z = 3, 

2x+ y — ^z=—^, 2x-h y — 4z = 4. 

4. a; — 32/+ 4z = 1, 5. a; — 32/ + 4z = 1, 

4x- 122/ + 16z = 3, 4x- 12 2/ + 16z = 4, 

3x- 92/ + 12z = 3. 3x- 92/ + 12z = 3. 

6. Discuss the equations in Exs. 4 and 5, p. 134, when two or more of the num- 
bers a, 6, c, k are equal. 

7. Discuss the equations in Ex. 6, p. 134, when a = — 2. 

26. Homogeneous Linear Equations. When the known terms ki, . . . , 
kn in (17) are all zero, the equations are called homogeneous. The determi- 
nants K are now all zero, so that the n homogeneous equations are never 
inconsistent. This is also evident from the fact that they have the set of 
solutions Xi = 0, . . . , Xn = 0. By (18), there is no further set of solu- 
tions if D 5»^ 0. If D = 0, there are further sets of solutions: if D is of 
rank r, there occur n — r arbitrary parameters in the general set of solu- 
tions (§ 25). A particular case of this result is the much used theorem: 

A necessary and sufficient condition that n linear homogeneous equaliona 
in n unknowns shall have a set of solutionSy other than the trivial one in which 
each unknown is zero, is that the determinant of the coeffi/nents be zero. 

27. The case of a system of fewer than n linear equations in n un- 
knowTis may be treated by means of the Lemmas in § 25. 

In case we have a system of more than n linear equations in n unknowns, 
we may first discuss n of the equations. If these are inconsistent, the 
entire system is. If they are consistent, the general set S of solutions may 
be found and substituted into the remaining equations. There result 
conditions on the parameters occurring in S, and these linear conditions 
may be treated in the usual manner. Ultimately we get either the gen- 
eral set of solutions of the entire system of equations or find that they are 
inconsistent. To decide in advance which of these cases will arise we have 
only to find the maximum order r of a non-vanishing r-rowed determinant 
formed from the coefficients of the unknowns, taken in the regular order 
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in which they occur in the equations, and ascertain whether or not the 
(r + l)-rowed determinants /f, formed as in § 25, are all zero.* 

28. An important case is that of n non-homogeneous linear equations 
in n — 1 unknowns Xi, . . . , Xn-i. By multiplying the known terms by 
Xn = 1, we bring this case under that of n homogeneous linear equations 
in n unknowns (§ 26). Then (18) gives Dxn = 0, D = 0, so that the given 
equations are inconsistent if D 5*^ 0. 

There is no set of solutions of the n equations 

dnXi + anX2 + . . . + ain-lXn-l = fci, 



On 1^1+ an2a^ + 



+ ttn n-lXn-l = fe 



nj 



flu . . . flln-l fcl 
(In I • • • On n— I 'Cfi 



5^0. 



EXERCISES 
Discuss the following systems of equations: 

I. x+ 2/ + 32 = 0, "^2. 2x- 2/+ 42 = 0, 
x + 2y + 2z = Q, x+ Sy - 2z = 0, 

x + 5y— 2=0. X— 11!/ + 14 2 = 0. 



^3. X- 3y+ 42 = 0, 
4x- 12!/ + 162 = 0, 
3x- 92/ + 122 = 0. 



4. 6x + 42/ + 32-84ii; = 0, 
x + 22/ + 32-48w? = 0, 
X-22/+ 2-12ti; = 0, 
4x + 42/— 2 — 24m; = 

6. 2x+ y + Sz = 1, 
4x + 2!/ - 2 = -3, 
2x+ y — 42= —4, 
10x + 52/-62= -10. 



5. 2x+ 32/- 42+ 5w; = 0, 
3x+ 5y - 2+ 2ti; = 0, 
7x + ll!/- 92 + 12ti; = 0, 
3x+ 42/- ll2 + 13w; = 0. 

7. 2x- 2/ + 32 = 2, 
X + 72/+ 2 = 1, 
3X + 52/-52 = -3, 
4x-32/ + 22= 1. 



8. Obtain ' a consistent system of equations from the system in Ex. 7 by replac- 
ing the term —3 by a new value. 

9. In three linear homogeneous equations in x, y, 2, w^ the latter are proportional 
to four determinants of order 3 formed from the coefficients. 



* For an abbreviated statement, the concepts matrix and its rank are needed. Cf, 
B6cher, Introduction to Higher Algebra^ p. 46. 



CHAPTER XII 
Resultants and Discriminants 
1. Introduction. If the two equations 

are simultaneous, i.e., if x has the same value in each, then 

x = = — , fisod — 6c = 0, 

o c ' 

and conversely. Hence a necessary and sufficient condition that th& 
equations have a common root is ft = 0. We call R the remUant (or 
eliminant) of the two equations. 

The result of eliminating x between the two equations might equally 
well have been written in the form fee — od = 0. But the arbitrary 
selection of fi as the resultant, rather than the product of R by some 
constant as —1, is a matter of more importance than apparent at first 
sight. We seek a definite function of the coefficients a, 6, c, d of the funO' 
lions ox + 6, ex + d, and not merely a property ft = or ft ^ of the 
corresponding equations. Accordingly, we shall lay down the definition 
in § 2, which, as the reader may verify, leads to ft m our present example. 

Methods of elimination which seem plausible often yield not ft itself, 
but the product of ft by an extraneous function of the coefficients. This 
point (illustrated in Ex. 3, p. 156) indicates that the subject demands a 
more careful treatment than is often given. 

We may even introduce an extraneous factor zero. Let a 7^ 0, 

/(x)= X^ - 2aX - 3a2, g[x)= X- a. 

From / subtract {x + a)g. Multiply the remainder, — 2a(x + a), by x — 3 a 
and add the product to 2 af. The sum is zero. But the resultant is —4 a* (the 
value of / for X = a) and is not zero. As we used g oaly in the first step and there, 
in eflect, replaced it by x^ — o^, we really found the resultant of the latter and /. 
The extraneous factor introduced (c/. Ex. 7, p. 152) is the resultant of x + a 
and / and this resultant is zero. 

150 
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2. Resultant of Two Polynomials in x. Let 

.-V ( f{x) = 003^+ aix'^^+ • • • + cu (oo 5-^ 0) 

I g{x)= M'* + feix»-i + • • • + fen (bo 9^0) 

be two polynomials of degrees m and n. Let ai, . . . , am be the roots of 
f{x) = 0. Now ai is a root of g{x) = only when g{ai) = 0. The two 
equations have a root in common if and only if the product 

g{ai)g{a2) . . . giom) 

is zero. This symmetric function of the roots of /(x) = is of degree n 
in any one root and hence is expressible as a polynomial of degree n in the 
elementary symmetric fimctions (Chap. VII, §3), which equal — ai/oo, 
02/00, .... To be rid of the denominators oo, it suffices to multiply our 
polynomial by oo'*. We therefore define 

(2) R(Jy g) = (urg{ai)g{pLi) . . . gion,) 

to be the resultant of / and g. It equals a rational integral function of 

Oo, . . . , dmi OOf . . . , On* 

EXERCISES 

- 1. If m = 1, n = 2, R(fj g) = Mi* — feiOoOi + fejOo*. 

-2. n m = 2, n = 1, R(S,g)^ ao(6oai + fei)(feoa2 + fei)= oofei* - Oifeofei + oM, 

since Oo(ai + 02) = — Oi, Ooaiai = O2. 

"^3. If /3i, . . . , /3n are the roots of g{x) = 0, so that 

^(oi)= feo(oi — /3i)(a» — /32) . . . (a» — /3n), 

then 

^(/, ^) = Oo'^feo'" (ai - /3i)(ai - /32) . . . (ai - fin) 

• (02 — /3i)(a2 — /32) . . . (a2 — /3n) 



• {ctm — /3i)(am — /32) . . . (om — fin)- 

Multiplying together the differences in each column, we see that 

4. If m = 2, n = 1, R(gff)= ho^f{—bi/ho)= Oofei* — Oifeofei + Osfeo*, which equals 
R(ft 9) by Ex. 2. This illustrates the final result in Ex. 3. 

5. If rw = n = 2, /2(/, g) = a^Wa^a^ + ao'6o6iaia2(ai + 02) 

+ Oo^bJhiai^ + a2^) + (hWaiai + 00*6162(01 + 02) + 00*62* 

= 60*02* — 6061O1O2 + 6062(01^ — 2 O0O2) + 61*0002 — 6162O0O1 + oo*6f*. 
This equals /2(y,/), since it is unaltered when the o's and 6's are interchanged. 
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6. R 'la homogeneous and of degree n in Oo, ..., Om; homogeneous and of 
degree m in bo, ..,, bn. R has the terms 

8. /2(/,x'») = (-l)'"»o«\ 

3. Irreducibility of the Resultant of Two Polynomials in One Varia- 
ble.* The resultant of two polynomials /(x) and g{x) was seen (§ 2) to 
equal a polynomial r (oq, . . . ,am, 6o, . . . , 6n) in the coefficients of /and ^. 
Let these coefficients be regarded as independent variables. Then r b 
irredudhlej i.e., is not equal to the product of two polynomials n and ri 
in oo, . . . , 6n with numerical coefficients, if neither ri nor r2 is a numerical 
constant.** Suppose that r = rir2. Since r is homogeneous in Oo, . . . , cu, 
each factor r^ is. Likewise, each r* is homogeneous in to, ... , 6*. Hence 

H '^' • • • ' "^'^'V • • • ' W^ A '^' ' • • / 'V^^' '/ • /' 

Replace ai/oo, . . . , a»/ao by the corresponding synmietric functions of 
the roots ai, . . . , a«, also 61/60, • • • , 6„/6o by the corresponding sym- 
metric functions of /3i, . . . , /3n. Let the factors on the right become 
the polynomials Pi and P2 in ai, . . . , /3„. Then (Ex. 3), 

(ai — jSi) . . . (ai — /3n)(Q:2 — /3i) . . . {am — jSn) = P1P2, 

identically in the a^s and /3^s. Apart from numerical factors, Pi is there- 
fore the product of certain of the differences ai — jSi, . . . , and P2 the 
product of the others. But this is impossible since Pi is synmietric in 
ai, . . . , Om and synmietric in i3i, . . . , jSn. 



4. A Correct Conclusion to be Drawn from Any Method of Eli] 
tion. Since the determination of r by means of symmetric functions of 
the roots is excessively laborious unless m or n is very small, we shall later 
give other methods. But we shall not know, without a careful enquiry, 
whether or not such a new method introduces an extraneous factor. Each 

* In place of §§3, 4, the reader may use § 9. But this substitution should be made 
only if the briefest course is desired. 

** This is evident for the resultant ad ^ he in § 1. For, if it were the product of two 
linear functions, the one not involving a would necessarily be d (or a numerical constant 
times d) and similarly the other factor would then be a. 
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method leads in fact to a polynomial F(ao, . . . , 6n) with the property 
that every. set of solutions Oo, . . . , 6n of r = is a set of solutions of 
F =0. It then follows that r is a factor of F. 

For ejcample, if /?(/, g) — 0, 

/ s oox* + aix + 02 = 0, g s 6oX* + tix + 62 = 
have a common root x. Then 

hif '- otQ = (0062 — <iJ)q)x^ + (0162 — a2&i)a; = 0, 
— 60/ + oofi' = (0061 — ai6o)a; + 0064 — 02^0 = 0. 
Exclude for the moment the case 02 => 62 = 0. Then x 9^ and 

0062 —ajbo ajh •— 0261 
0061 — aifto flo&2 —0260 



(3) F s 



= 0. 



It is easily verified that F = also in the excluded case. Hence any set of solu- 
tions Oo, . . . , 62 of r = is a set of solutions of F = 0. We found r in Ex. 5 
above. It is seen to be identical with this F. 

To prove in general that r is a factor of F, set 

r = CbOo" + CiOo"-^ + • • • + Cn, 

where Co, . . . , Cn are polynomials in ai, . . . , 6„, while Co is not identi- 
cally zero (Ex. 6 above). Express also F as a polynomial in Qq and apply 
the greatest common divisor process to F and r. Suppose that r is not 
a factor of F. If * the degree of F in a© is ^n, we may write 

koF = qor + ri, kiV = qiTi + r2, hn = qir^ + rs, 

where 5^0, Q'l, 0^2, ^i, r^ may involve Oo, while fco, fci, fe, rs do not (for sim- 
plicity we assume that Tz is the first Ti not involving Oo). If rs were iden- 
tically zero, r2 (or a factor actually involving Oo) would be a factor of r, 
as shown by the last two of our three equations. Since r2 is of lower degree 
in Oo than r, this contradicts the irreducibility of r (§ 3). Hence there 
exist constants ai', . . . , 6n' such that 

r3(ai', . . . , 6n') j^ 0, C(,(ai', . . . , 6n') J^ 0. 

For a\ = a/, . . . , 6n = 6n', r becomes a polynomial in Oo with constant 
coefficients and hence (Ch. V) vanishes for some value Oq of a©. By 

* In the contrary case, we drop the first equation and set fi * F. 
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hypothesis, any set of solutions, as ao,ai, . . . , bn' of r = is a set of 
solutions of F = 0. Hence F(ao', . . . , bn) = 0. For these values 
oo', . . . , 6n' of Oo, . . . , bn, wc have ri = by the first of our three 
equations, then r2 = by the second, and ra = by the third. The last 
result contradicts rz{ai, • • • » bn) ^ 0. 

7/ any method of eliminating x between two equations in x leads to a relar- 
tion F = 0, where F is a polynomial in the coefficients, then F has as a factor 
the true resultant of the equations. 



5. Sylvester's Dialytic Method of Elimination. Let the equations 

OoX* + aix^ + a2X + as = 0, 60a:* + biX + 62 = 

have a common root x, so that their resultant r is zero. 

Multiply the first equation by x and the second by x? and x in turn. 
We now have five equations 

ooor* + a\X? + 02^2 + a«x =0, 

oox* + aix^ + 02^ + as = 0, 
hffl^ + 6ix» + &2X* =0, 

6oX» + 6ix2 + 6ax = 0, 

b{fl? + biX + &2 = 0, 

which are linear and homogeneous in x^, x*, x*, x, 1. Hence 

an ai O/i az Q 



(4) 



F= 



an ai Oi az 

bo bi bi 

bo bi bi 

bo bi bi 



= Ir 



is zero. By § 4, r is a factor of F. But the diagonal term Oo^V of F is a 
term of r (Ex. 6, p. 152). Hence F is the resultant. 

In general, if the equations are 



OoX^ + 



... 



+ cu = 0, 6oX'* + 



... 



+ 6n = 0, 



we multiply the first equation by x""^, x*~*, . . . , x, 1, in turn, and the sec- 
ond by af^\ X"*"*, . . . , X, 1, in turn. We obtain n + m equations which 
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are linear and homogeneous in the m + n quantities a:"^*~^, . . . , x,l, 
Hence the determinant 



(5) 



F= 



Oo CLl Ot - ' ' dm . . . . 

Oo ai 02 . . . Otn . 
Oo ai 02 . . . cu 





bo 6i . . 
6o 6i 



Oo 



Oi O2 
6n 0. 







. . 
. . 



• • 0|»» 







. . 



... bo 61 6, 



n rows 



m rows 



is zero. By § 4, r is a factor of F. But the diagonal tenn Oo^bn*" is a 
term of r. Hence F is the resultant. 



EXERCISES 

1. For m = n = 2, the resultant is 

Oo Oi 02 

Oo Oi 02 

bo hi bt 
bo bi bi 



Interchange the second and third rows, apply Laplace's development, and prove 
that 

r = (0062)' - (0061) (oiW, 

where (0062) denotes 0062 — 0260, etc. Compare with (3). 



2. For m = n = 3, show by interchanges of rows that 






Oo 


Oi 


02 


at 








bo 


61 


62 


6s 











Oo 


Oi 


02 


Ot 








bo 


61 


62 


6s 











Oo 


Oi 


02 


Ot 








bo 


61 


6« 


6« 
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Apply Laplace's development, selecting minors from the first two rows, and to the 
complementary minors apply a similar development. This may be done by 
inspection and the following value of — r be obtained: 



(ao6i)} (0162) (0263) 
-(ajb2)l(aj)2){aj>z) 



iaibzY+iajbzKiuMl 
{ck/)z){aibz)\ 



The third term of the first line and the first term of the last line are alike, 
changing the signs, 

r = (0063)' - 2 (0061) (0063) (0263) - (0062) (ao6j) (0163) 

+ {aJhY (0263) + (ao6i)(ai63)' - (0061) (ai6,) (0263). 

3. For m = n = 3, the method which led to (3) gives 

- &q/ + oofl' = (oofti) x^ + (ajbi) X + (0063), 
(63/ - azg)/x = (0063) x^ + (0163) X + (0261). 



Hence, 



By (3), the resultant of these two quadratic functions is 



F = 



(0063) (0061) 
(0263) (0063) 



(ajbz) (0061) 
(0163) (0062) 



(0163) (0062) 
(0263) (0063) 



This is, however, not the resultant r of the cubic functions /, g. To show that 
(ajh) is an extraneous factor, note that the terms of F not having this factor 
explicitly are 

(a</)i){a^z) \ (0060(0263) - (oo62)(oi63) j . 

The quantity in brackets equals — (oo63)(oi62), since 



0=1 



Oo Oi O2 O3 

60 61 62 63 

Oo Oi 02 O3 

60 61 62 63 



= (0060(0263) - (0062) (0163) + (Oo63)(Oi62). 



We now see that F = r - (0063), where r is given in Ex. 2. 

4. Verify that (0063) is an extraneous factor by showing that if x* — 1 = 0, 
X* - X = 0, then r = 0, (0063) 9^ 0. 

5. The resultant of L s ax + /St/ and L' 3 ax + /S'j/ is ft = a/J' — afi. The 
determinant of the coefficients of x', xt/, t/^ in L' = 0, LU = 0, L'* = is 



R' 



a» 


2 a/3 


0" 


aa 


a/S' + a'/S 


/S^' 


a'2 


2a'/S' 


/3'« 



i 61 RESULTANTS AND DISCRIMINANTS 157 

If /2 = there east values not both zero of x and y such that L = L' — and 
hence values of x^, xy, j/', not all zero, such that I? — 0, etc. Thus jB = implies 
R' = 0. Since R is irreducible, it is a factor of R', But if /2' = 0, we are not to 
infer hastily that the values of x', xy, y'^ obtained from the three equations linear in 
them are consistent (i.e., the product of the first and third equals the square of 
xy) and hence have no right to conclude that R' — implies jB = and thus that 
R' is a power of R (as done in some texts). 

If R' =^ 0, the three linear homogeneous equations whose coefficients are the 
elements in the three rows of the determinant R' have solutions not all zero, which 
may be designated x^yXy — Zf y^. Then the equations may be written in the form 

L* = 2 a/82, LU = (a/J' + a'0)z, L'» = 2 a'fi'z. 

Thus 

= {LUy - I?U^ = /2»2». 

If ft ^ 0, then 2 = 0, L = L' = 0, ftx = ftj/ = 0, whereas x, y, z are not all zero. 
Hence ft' = impUes ft = 0. Thus each irreducible factor of ft' is a numeric^ 
multiple of ft. By examining one term of ft', we see that ft' = ft*. 

6. The determinant of the coefficients of x', x*y, xy*, y* in 

L» = 0, L«L' = 0, LL'» = 0, L'» = 0, 

equals ft*. Prove as in Ex. 5 and also as in Ex. 7. 

7. Reduce the determinant ft' in Ex. 5 to the form R^. If /9 = 0, ft' is evidently 
ft'. If /S ^ 0, multiply the elements of the second column by — o//9, those of the 
third column by 0*7/3*, and add the products to the elements of the first column. 
The elements of the new first column are 0, 0, ft*//3*. Hence 






2a/3 /3 


ft* 


«/S fi 


a/S' + a'/3 /3' 


fi 


«'/s fi' 



= R^-R. 



8. If f or F = we omit one of the equations in § 5, we have a consistent set 
of equations which determine x in general. Thus if m = n = 2, x/ = 0, / = 0, 
(7 = give ao{aJ)i)x = — 00(0062). The latter is in agreement with the linear 
equation in the example, p. 153. 

6. Discriminants. Let ai, . , . , omhe the roots of 
(6) f{x) s aoX"» + aizr-^ + • • • + cu = (<k 7^ 0) 

As in Ch. Ill, § 3, we define the discriminant of (6) to be 

D = Oo^ "•'*(«! — a2y{ai — 03)* . . . (ai — a«)*(a2 — as)* . . . (a«-i — Om)*. 

Evidently D is unaltered by the interchange of any two roots. Since the 
degree in any root is 2 (m — 1), the symmetric function D equals a poly- 
nomial in oo, . . . , am- Indeed, Oo***"* is the lowest power of Oo sufficient 
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to cancel the denominators introduced by replacing Sai by — ai/o©, • . • , 
«!«! . . . a« by zfc cu/oo. Now * 

/'(ai) = Ooiai — a2){ai — az) . . . (ai — ««), 
/'(«2) = ao{ot2 — ai)(a2 — as) . . . ("2 — Om), 
/'(a3)= 00(03 — ai)(a3 — ot2){az — a^ ... (03 — Om), .... 

Hence 

m(m— 1) 

By (2), the left member is the resultant of /(x), f{x). Hence 

m(m— 1) 1 

(7) 0=(-l) » i«(/./'). 

Oo 

For another proof that D is a numerical multiple of R/oq, see Ex. 9 below. 



EXERCISES 

1. Show that the discriminant of f = y^-^py + q — Oia —4 p* — 27 g* by 
evaluating the determinant of order five for /2(/, /'). 

2. Find the relation between the discriminant of f(x) — and the resultant of 
tnfix) — xf' (^) and/'(x). 

3. Hence the discriminant of oox* + oix* + a«r + aiis — Jr, where 

r = (0102 — 9 ooas)* — (2 02* — 6 aiaj)(2 ai* — 6 OoOj)* 

is the resultant of aix^ + 2a«r + 3a3 = 0, 3 oox' + 2 OiX + oj = 0, by (3). 

4. The discriminant of the product of two functions equals the product of their 
discriminants multipUed by the square of their resultant. Hint: use the expres- 
sions in terms of the differences of the roots. 

5. Oo is a factor of R(fj /') by the first column of its determinant. 

6. For Oo = 1, the discriminant equals 



1 ai ai^ . . . ai 



m-l 



1 



at 02' 



. . at 



m-l 



1 



Om am 



Clm 



m-l 



So 
Si 



«1 8t 
Si 8z 



• • • Sm— 1 
... 8ifi 



5m— 1 Sm Sm+1 • • • 52 m— S 



where »» = oi' + • • • + am*- See Exs. 10, 11, p. 141. 

* By differentiating f(x) - ao(x — ai) . . . (j — om) or by the first part of § 5, 
Ch. VII. 
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7. Hence the discriminant of oc^ + px + q = equals 

3 ~2p 
-2p -3g 
-2p -35 2p2 



= -4p»-27g^ 



8. The discriminant D(ao, ..., Om) is irreducible. As in § 3, a factor would 
equal a product P of powers of the differences oi — 0/ such that P is symmetric 
in ai, . . . , Om' Thus every difference would be a factor. But the product 
of the first powers of all the differences is changed in sign by any interchange of 
two roots (Ch. XI, § 12). Hence P is divisible by the square of the last product. 

9. Prove that D is a constant times R(J,f) -^ oo by use of § 4. Since D = 
implies ft = 0, the irreducible D is a factor of R, But Z) is of total degree 2 m — 2 
in Oo; ai, . . . , and ft is of total degree 2 m ~ 1. Hence R/D is of the first degree 
and thus (Ex. 5) a numerical multiple of Oo. 

T.f Euler's Method of Elimination. Let / and g be given by (1). 
If /(x) = and ^(x) = have a common root c, then 

/(x) = (x - c) /i(x), g{x) s (x - c) sfi(x), 

identically in x, where /i(x) is a polynomial of degree m — 1, and ^i(x) is 
of degree n — 1. Hence 

f{^)gi(x)^g{x)fi{x), 

identically in x. Hence if Oo, . . . , 6n are any numbers for which ft(/, g) 
=0, there exist constants g^i, . . . , Qn, Pu . . . , Pm not all zero for which 

(oox^ + aix*"-^ + • • • + cu) (gix'*-^ + g2X''-2 + • • • +gn) 

■ (60X'* + biX"""^ + • • • + 6n)(PlX"*"^ + P2X"*-2+ • • • + Pm), 

identically in x. Equating the coefficients of like powers of x in the two 
products, we obtain the relations 

Oo^i — bopi = 0, 

ai(Zi + ^2 — hipi — 60P2 = 0, 

Om(?n-l + dfn-iqn — bnPm-1 — bn-lPm = 0, 

amQn — bnPm = 0. 

Since these m+n linear homogeneous equations in the unknowns (?i, . . . , 
Qny —Ph' . . f —Pm have a set of solutions not all zero, the determinant 
of the coefficients is zero. Interchanging the rows and columns of this 
determinant, we get (5). The proof that (5) is the resultant follows as in 
the last two lines of § 5. 
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S.f Bezout's Method of Elimination. When the two equations are 
of the same degree, the method will be clear from the example 

/ a oox^ + OiX* + O2X + a8 = 0, flf- boi? + bix^ + 6aX + &8 = 0. 

Then 

Ooflf — bofj 

(8) (oox + ai)g- {boX + 6i) /, 

equal respectively 

(a(j6i)x2 + {(uM X + (0063) = 0, 

(ao&2)x2+ { (0063) +{aib2)lx+{aih) = 0, 

(0063)2:* + {aibz)x + ((hbz) = 0, 

where (oobi) = Oobi — ai6o, etc. The determinant of the coeflScients is the 
negative of the resultant i?(/, fif). Indeed, it is divisible by K (§ 4) and 
has a term of —R. The negative of the determinant is seen to have the 
expansion given as r in Ex. 2, p. 156. 

The three equations used above are evident combinations of 

xV=0, x/=0, /=0, a:V = 0, xg = 0, ^ = 0, 



the latter being the equations used in Sylvester's method of elimination, 
determinant of the coefficients in these six equations is 

oo ai 02 as 



The 



R^ 



Oo Oi Os Oi 









do 


Oi 


(h 


a* 


&0 


61 


&> 


bt 











&o 


61 


62 


6. 











60 


61 


bt 


&> 



The operations carried out to obtain the above three quadratic equations are 
seen to be step for step the following operations on determinants. First, Oo'jB 
is derived from the determinant R by multiplying the elements of the last three 
rows by oq. To the elements of the new fourth row add the products of the ele- 
ments of the 1st, 2nd, 3rd, 5th, 6th rows by —60, -"61, — 6j, Oi, 02 respectively 
[corresponding to the formation of the third function (8)]. To the elements of 
the fifth row add the products of the elements of the 2nd, 3rd, 6th rows by —60, 61, 
oi respectively [corresponding to the second function (8)]. Finally, to the ele- 
ments of the sixth row add the products of the elements of the third row by —6© 
[corresponding to ooQ — 60/]. Hence 
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a<f R = 



Oo 


fli 


at 


(h 











Oo 


fli 


(h 


Ot 











Oo 


ai 


Ot 


Ot 











: (ao6,) 


(aJn) 


(a,6s) 











• (ao6,) 


(ao6,) + (ai60 


(aiW 











(ajbi) 


(0062) 


(006,) 



so that /? equals the 3-rowed mmor enclosed by the dots. The method of B^zout 
therefore suggests a definite process for the reduction of Sylvester's determinant 
of order 2 n (when m = n) to one of order n. 

Next, for equations of different degrees, consider the example 

/ = OoX* + aix* + a^ + OaX + 04, ^ = 6ox* + friX + &2. 
Then 

oox^flf - Ihf, (oox + ai)3i?g - (6ox + 61)/ 

equal respectively 

{ajbi) 3? + {ajth) ^ — aaboic — oA), 

(0062) x* + { (0162) — aabo} a:^ — {oabi + aA} a: — 0461. 

The determinant of the coeflScients of x*, x^, x, 1 in these and xg, g, after 
the first and second rows are interchanged, is the determinant of order 4 
enclosed by dots in the second determinant below. It is the resultant 
RiS, g) by § 4. 

As in the former example, we shall indicate the corresponding operations on 
Sylvester's determinant 



R = 



Oo 


ai 


02 


at 


ai 


u 





Oo 


ai 


at 


at 


04 


ho 


61 


ht 





. 








ho 


hi 


62 














ho 


hi 


62 














ho 


hi 


62 



Multiply the elements of the third and fourth rows by oo. In the resulting deter- 
minant oo^Rj add to the elements of the third row the products of the elements of the 
first, second and fourth rows by —60, —61, Oi respectively. Add to the elements 
of the fourth row the products of those of the second by — 6^. We get 
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ao»i2 = 



Oo 


-«i 


a% 


(h 


04 








Oo 


ai 


at 


Ot 


04 








(0062) 


(flibt) - ajbo 


— oA — 0160 


— aii>i 








(0061) 


(ajh) 


— a«6o 


—0460 








bo 


61 


bt 








: 





60 


bi 


6, 



Hence /{ equals the minor enclosed by dots. 

EXERCISES t 

1. For m =: 3, n = 2, apply to Sylvester's determinant R exactly the same 
operations as used in the last case in § 8 and obtain 

{OiM {ajbt) —aJbo -ajbi 

R= (ajbi) (ajh) — a«6o 

bo &i &2 

2. Hence show that the discriminant of 002:* + aix* + o^ + os =" is 

2 OoOs 0102 + 3 OoOs 2 OiOs 
Oi 2 Ot 3 Oi 

3 Oo 2 Oi 02 

= 18 O0O1O2O1 — 4 00O2' — 4 oi'oi + OiW — 27 OoW. 

3. For m =^ n = 4y reduce Sylvester's i2 (as in the first case in § 8) to 

(aJ>i) (ajh) (ao6j) (0064) 

(0063) {ajh) + (0162) (0064) + (oiW (0164) 
{aJh) (0064) + (0161) (0164) + (0261) (0264) 

(0064) (0164) (ajbi) (ajbi) 

4. For / and g of degree n, the tth function (8), when written as a determinant 
of the second order, is seen to equal 

diix""'^ + di^""-^ + • • • +d,„, 
where 

dij = (ao6t+^i) + (ai6,-|.,-2) + • • • + (o»-i6,). 
Then 



n(n-l) 



du ... din 



flnl • • . (*nn 

This D is called the B^zout determinant of / and g. Show that da » d^. 
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5. Hence verify f or m = n = 5 that R can be derived from 

(ajbi) (0062) (ao6j) (ajbi) (0065) 

{ajbi) (oofci) (aJbi) (aJbi) (aihi) 

{aJh) (0064) (0065) (0165) (0265) 

(aJbi) (0065) (0165) (0465) (0165) 

(0065) (0165) (0265) (0365) (0465) 

by adding to its nine central elements the elements of 

(0162) (aihz) (aJbi) 

(aJh) (aihi) + (chbz) (0^)4) 
(aihi) (ajbi) (0364) 

6. If R{ff g) = 0, we obtain a consistent set of equations by omitting one of 
B^zout's equations. Hence they determine x. If m = n = 2, find x. If m = n 
= 3, find X, 

7. If m = n, set giix) = x^^-^gix). Then 

8. If m = n, R{cf +dg,8f + tg)= ±{ct- d8)'^R{f, g). [Find the new (0^6,).] 

9. Express as a determinant of order m the resultant of f{x) = and x"* = 1. 
[Multiply/ by x and reduce by x"* = 1; repeat.] 

9.t Without employing the results of §§3, 4, we may give a direct 
proof that the determinant (5) is the resultant of / and g, given by (1). 
While the method is general, we shall present it only in the case m = 3, 
n = 2. In the equation 



(9) 



Oo 


ai 


Ot 


az — z 








Oo 


Oi 


Ol 


a» 


bo 


61 


h 











bo 


bi 


h . 











bo 


61 


bi 



— z 



= 0, 



take z = fiPi)- Multiply the elements of the first four colunms by 
Pi^, Pi^y Pt^, Pi, respectively, and add the products to the last column. 
All of the elements of the new last colunm are zero. Hence /(j8i) and 
/O32) are the roots of (9). Since the equation is of the form 

6oV + ( )2 + F = 0, 
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where F is given by (4), we have 

F= 6o»/08i)/(ft). 

Hence the Sylvester determinant F is the resultant B(J, g). 
Moreover, the equation in z is the eliminant of 

g{x) = 0, z= fix), 

and hence gives explicitly the equation obtained from g{x) = by apply- 
ing the transformation z = fix) of Tschimhausen (Ch. VII, § 13). 

10. t Theorem. Necessary and sufficient conditions that f{x) and g(x) 
shall have a common divisor of degree d, hut rume of higher degree, are 72 = 0, 
fli = 0, . . . , Rd-i = 0, Rd 9^ 0, where R is the determinant (5), and Rk is 
the determinant derived from R by deleting the last k rows of a*s, the last k 
rows of Vs, and the last 2 k columns. 

For example, if m = n =» 4, 



(10) 



i2i = 



Oo 


ai 


at 


Ot 


04 








Oo 


ai 


02 


Os 


04 








Oo 


ai 


02 


Oi 


&o 


6i 


Ih 


ht 


64 








ho 


6i 


h 


6, 


64 








ho 


hi 


ht 


6, 



To prove the theorem for the case d = 1, set 

/i = pi£^'^ + • • • + Pm-h gi = Ql^"""* + 



• • • 



The conditions for an identity of the form 
(11) fgi-gfi^cx + c' 



+ 3n-l. 



are 



flog. 



hpi 

hiPi — 60P2 



= 0, 
= 0, 



arnQn-l 



— bnPm-2 — bn-lPm-l = C, 
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Omitting the last equation, we have m + n — 2 linear equations for the 
same number of unknowns g,-, — ^»-. The determinant of the coefficients 
equals Ri with the rows and colunms interchanged. Hence ii Ru^ 
we may choose c = Ri and find values not all zero of the unknowns satis- 
fying all of the above equations except the last, and then choose c' so that 
the last holds. Let R = 0. Then / and g have a conmion linear factor, 
but no common factor of degree > 1 since the right member of (11) is of 
degree unity. 

But if B = 12i = 0, we may take c = and find values not all zero of 
Qi, Pi satisfying all but the last of the above equations. The resulting 
value of c' is zero by (11), with c = 0, since/ and g have a common factor 
X ^ r. Then 



X — r X — r' 

Since not all of the m — I linear factors of the first fraction are factors 
of /i (of degree m — 2), at least one is a factor of the second fraction. 
Hence if B = Bi = 0, / and g have a common factor of degree > 1. 

To prove the theorem for d = 2, we employ functions fz and g2 of de- 
grees m — 3 and n — 3, respectively. Of the conditions for the identity 

(12) fg2 -gh = ca? + c'x + c", 

we omit the two in which c' and c" occur and see that the determinant 
of the coefficients of the remaining equations is R2. Then if 

B = Bi = 0, 222 5"^ 0, 

we may take c = K2 and satisfy all of the conditions for (12). Thus 
/ and g have no conmion factor of degree > 2. 



EXERCISES t 

1. By performing on (10) exactly the same operations as used in § 8 to reduce 
a determinant of order 6 to one of oi:der 3, show that 

(ao6j) (0064) + (ai6j) (0164) + (0263) 
Ri = (0062) (0063) + (0162) (0064) + (aih%) 
(oofti) (0062) (ao6j) 

Note that if 04 = 64 = 0, the present work reduces to the former. 
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2. In the notation of Ex. 4, p. 162, the preceding Ri with its first and third rows 
interchanged becomes Dii 



A = 



dii dit diz 
da dn dn 
dn di2 dtt 



Ri = -A. 



3. Form = n, 



Oil . . • di n-k 
dn-*l . . . dn~kn-k 



4. Hence, if m = n, / and g have a common divisor of degree d, but none of 
degree > d, if and only if D = 0, A =0, . . . , 2>d-i = 0, 2>d ?^ 0. 

5. Give a direct proof of Ex. 4 by multiplying the ith function in Ex. 4, p. 162, 
by a variable yi and summing for i = 1, . . . , ^ Thus 

fl"{ao2/i+(aoX + ai)2/2+ • • • +(aox'~*+ • • • )yt j-/'{iwi+(^oX + 6i)ya+ • • •} 
where «i == dnj/i + • • • + dn2/t, . . . , «n= dml/i + • • • + d«ny«. 



The determinant of the coefficients of yi, . , , , yt in di, . . . , d( is Dn~t. If 
D = 0, taket = n; then we can choose j/i, . . . , ^n not all zero so that dis=0,. . . , 
a„ = 0. Then gfi — fgi ^ for functions /i and ^i of degree n — 1, so that / 
and g have a linear divisor. If also Di = 0, take t = n ^ 1; then we can make 
«i = 0, . . . , «n-i = 0. Hence gfz — /fl'2 ■ «n for functions /2 and ^j of degree 
n ~ 2. Since / and g have a common divisor, the constant 8n is zero, and hence 
they have a common divisor of degree ^ 2. But if Di ?^ 0, we can make 

gfi - fgt s Sn^lX + in, in-l^ 0, 

SO that the only common divisor is linear. 
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MISCELLANEOUS EXERCISES 

1. Find a necessary and sufficient condition that the roots a, /9, y of 
x^ + px* -^ qx + r = shall be in geometrical progression. 

2. For the same equation find Xo^0^, [Replace x by l/x.] 

3. Find the equation with the roots ct^ + fi\ a^ + y\ 0^ + 7*. 

4. Find the equation with the roots c^ + ^ — y^, c^ + y^ — 0^^ etc. 

5. Find the equation with the roots c? + a^ + 0^, etc. 

6. Solve the equation in Ex. 1 by forming and solving the quadratic equation 
with the roots (a + w/9 + w^t)' and (a + w*/3 + wt)', where w* + w + 1 = 0. 
(Lagrange.) 

7. Solve x* — 28 X + 48 = 0, given that two roots differ by 2. 

8. Find a necessary and sufficient condition that 

f{x) = x* + px* + qx^ + rx + 8 = 

shall have one root the negative of another root. When this condition is satisfied, 
what are the quadratic factors of /(x)? 

9. Solve /(x) sx* — fix' + lSx* — 14x + 6 = 0, given that two roots a 
and /3 are such that 2a + fi = 5. Hint: /(x) and /(5 — 2x) have a common 
factor. 

10. Diminish the roots of x* + qx^ + rx + 8 = (s 9^ 0) by such a number 
that the roots of the transformed equation shall be of the form a, m/aj 6, m/6, and 
show how the latter equation may be solved. 

11. Solve X* - 2 X* - 16 X + 1 = by the method of Ex. 10. 

12. By use of the equation whose roots are the] squares of the roots of 
x* + x' — x2 + 2x — 3 = and Descartes' rule, show that the latter equation 
has four imaginary roots. 

13. Similarly, x' + x2 + 8x + 6 = has imaginary roots. 

14. If all of the roots of x** + ax**~^+ ftx'*^ + • • • = are real, 

a»-26>0, 62-2ac + 2d>0, c» ~ 2W + 2a6 - 2/> 0, . . . . 

Hint : Form the equation in y = x*. 

15. Solve X* + px + g = by eliminating x between it and x^ + vx + w — y 
by the greatest common divisor process, and choosing v and w so that in the result- 
ing cubic equation for y the coefficients of y and y^ are zero. The next to the last 
step of the elimination gives x as a rational function of y. (Tschimhausen, Acta 
ErudiL, Lipsiae, II, 1683, p. 204.) 

16. Find the preceding y-cuh\c as follows. Multiply x^ + vx + w ^ y by x 
and replace r* by — px — g; then multiply the resulting quadratic equation in x 
by X and replace x* by its value. The determinant of the coefficients of x*, x, 1 
must vanish. 

17. Eliminate y between y* = v, x = ry + sy^, and get 

x» - 3 rsux -(rh) + «V) = 0. 

Take 8=1 and choose r and v so that this equation shall be identical with 3^ + px 
+ q = 0, and hence solve the latter. (Euler, 1764.) 
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18. Eliminate y between y* = y, a; =/4-ci/ + y*andget 

1 e f — x 



e f — X V 
f — X V ev 



0. 



This cubic equation in x may be identified with the general cubic equation by choice 
of c, /, V. Hence solve the latter. 

19. Determine r, s and v so that the resultant of 

X +r 

shall be identical with x* + px + g = 0. (B^zout, 1762.) 

20. Show that the reduction of a cubic equation in x to the form 2/* = by the 
substitution 

r + sy 



X = 



l + y 



is not essentially different from the method of Ex. 18. [Multiply the numerator 
and denominator of x by I — y + y^.] 

21. If the discriminant of a cubic equation is positive, the number of positive 
roots equals the number of variations of signs of the coefficients. 

22. Descartes' rule gives the exact number of positive roots only when all the 
coefficients are of like sign or when 

/(x)= X** + PiX"-^ + • • • + Pn-*X* - Pn-,+lX"-^- ' ' ' — Pn = 0, 

each Pi being ^ 0. Without using that rule, show that the latter equation has one 
and only one positive root r. Hints: There is a positive root r by Ch. I, S 12 
(o = 0, 6 = Qo). Call P(x) the quotient of the simi of the positive terms by x*, 
and call —Nix) that of the negative terms. Then N(x) is a sum of powers of 
1/x with positive coefficients. 

If x>r, P(x)>P(r), Nix)<Nir), /(x)>0; 

If X < r, P(x) < P(x), N{x) > N(r), /(x) < 0. (Lagrange.) 

23. If /(x)= /i(x) + • • • +/ik(x), where each /,(x) is like the/ in Ex. 22, and 
if /? is the greatest of the single positive roots of /i = 0, . . . , A = 0, then R is 
an upper limit to the positive roots of / = 0. 

24. Any cubic or quartic equation in x can be transformed into a reciprocal 
equation by a substitution x = nj + s. 

25. Admitting that an equation /(x) s x'* + • • • =0 with real coefficients 
has n roots, show algebraically that there is a real root between a and 6 if 
/(a) and /(6) have opposite signs. Note that a pair of conjugate imaginary roots 
c ±di are the roots of (x — c)* + ci^ = and that this quadratic function is 
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positive if a; is real. Hence if xi, . . . , Xr are the real roots and 0(x) =■ (x — Xi) 
. . , {x — Xr), then 0(a) and 0(6) have opposite signs. Thus a — a;,- and b — x< 
have opposite signs for at least one real root x,-. (Lagrange.) 

26. If 8j is the sum of the jth powers of the roots of an equation of degree n 
and if m is any integer, the equation is 



•n 



•n— 1 






. X 1 

' 8m+2 Sm+l 



Sm+2n— 1 5in+2n— 8 • 



Sm+» fim+n— 1 



= 0. 



Hint: Use the second set of Newton's identities. (Jacobi.) 
27. If a < 6 < c . . . <l, and a, /3, . . . , X are positive, 



+ 







+ 



X — a X — b X — c 



+ 



. • • 



+ 



x-l 



+ « = 



has a real root between a and 6, one between b and c, . . . , one between A; and ly 
and if / is negative one greater than Z, but if ^ is positive one less than a. 

28. Verify that the equation in Ex. 27 haa no imaginary root by substituting 
r + 81 and r — si in turn for x, and subtracting the results. 

29. In the problem of three astronomical bodies occurs the equation 

r^+ (3 - /x)r*+ (3 - 2//)/^ - ^r* - 2Mr ~ m = 0, 

where < m < 1. Why is there a single positive real root? As m approaches 
zero, two complex roots and the real root approach zero. 

30. Discuss the equation obtained from the preceding by changing the signs of 
the coefficients of r* and r. 

31. By Newton's identities. 



«s= - 



«*= - 



1 





Pi 




Pi 


1 


2p2 


= -Pi» + 3p 


Pa 


Pi 


3p3 




1 








... pi 


Pi 


1 





... 2p2 


Pi 


Pi 


1 


... 3p8 


Pi 


Pj 


Pi 

• • 


... 4p4 



P*-l Pk-2 P*-» ... Pi ^Pk 

where all but the last term in the main diagonal is 1, and all terms above the diagonal 
are zero except those in the last column. If A; > n, we must set py = 0{j > n). 
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32. By Newton's identities, 



3Ips=- 



l 8i 
8i 2 82 
5j Si 8z 



, A;! pifc = - 



if A; = n. Butif A; = n, 

8k fi*-l Sjfc-2 



1 


. 


. . 


«1 


8i 2 


. 


. . 


^ 


«a «i 


3 . 


. . 


Si 


«*-l 8t_j 


8k-t 


t m • K 


8k 



8k+l 8k . 



fijfc-1 



• • • 



8k-n 
«A— n+1 



= 0. 



8k+n fifc+n-l S*+n-2 . . . «* 

33. Let «< = ofi* + • • • + ttn*. Let ai', ...,«„* be the roots of 

yn + Pjyn-l+ . . . +P^ = 0. 

Sety=ay*andmultiply theresultbya/*"***, whereA;^2n. Sumforj—l, . . . ,n. 
Thus 

«ik + Pl8k-2 + Pi8k-i + • • • + Pn8k-in = 0. 

Hence 

8k «*-2 8k-i . . . fiifc-Jn 



fiib+l SA-1 



«*-« 



«*-J»H.i 



= 0. 



35. 



«o Si «j 
8i ^2 8z 
8i 8i «4 



«*+n «A+n-2 fiifc+n-4 . . . «*-n 

34. Obtain a vanishing detenninant similar to that in Ex. 33 but having the 
subscripts of the s's in each row decreased by 3. 

10 

8o 8i 8i 
«0 «1 8i 8i 
8i 82 8t 84 

P2 Pj 

81 + Pi 80 8i + pi5i + P180 

81 + Pi«o 82 + P181 + P180 «8 + Pl«2 + PtSl + P180 

82 + P181 Si + P182 + P281 8i + pi«8 + P2«2 + P981 

Pi P2 P% 

n (n - l)pi (n - 2)p2 

(n - l)pi (n - 2)p2 (n - 3)ps 
2p2 3pi 4p4 



1 


So 

Si 

1 


n 

Pi 



Pi 
So 
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36. If n = 3, the last detenninant may be obtained from the Sylvester resultant 
R oi a^ + jtix^ + PaX + pi and its derivative by multiplying the elements of the 
first row oi Rhy —3 and adding the products to the elements of the third row. 

37. Express the determinant of order 4 in the 8i (analogous to the first one in 
Ex. 35) as a determinant of order 6 in the p's. For n = 4, identify the latter with 
the resultant of x* + pix* + paX* + V^ + P4 and its derivative. 

38. Let Sk be the sum of the Arth powers of the roots Xi, . . . , Xn of a given 
equation. The coefficients of the equation having as its roots the J n{n — 1) 
squares of the differences of the x*a can be found from iSi, 1S2, . . . , where Sp is 
the sum of the pth powers of the roots of the latter equation. Expand by the 
binomial theorem 

(x - ziy^ +(z- XiY^ + • • • + (x - XnY^, 
set X = Xi, . . . , X = Xn in turn, add and divide by 2. Thus 

Q «. 9^. . I (2P)(2P-1) ^ ^ 



2p(2p-l) . . ■ (p + 1) ., 
1-2 






(Lagrange.) 

39. In particular, Si — riBi — Si', iSj = njj4 — 4 «i«8 + 3 «2*, 

& = nse — 6 fii«6 + 15 8^i — 10 Ss*. Hence give the equation whose roots are 
the squares of the differences of the roots of a given cubic equation. Deduce the 
discriminant of the latter. 

40. The equation whose roots are the n{n — 1) differences x, — Xk of the roots 
of /(x) = may be obtained by eliminating x between the latter and /(x + y) = Q 
and deleting the factor y*^ (arising from y = Xj- — xy = 0) from the eliminant. 
The equation free of this factor may be obtained by eliminating x between /(x) = 
and 

l/(x + y)-/(x)|/2/=/'(x)+r(x)-^+ . . . +fn(x)—j!^ =0. 

1' Z 1 '2 ... n 

This eliminant involves only even powers of y, so that if we set y* = 2 we obtain 
an equation in z having as its roots the squares of the differences of the roots of 
fix) = 0. (Lagrange.) 

41. Compute by Ex. 40 the 2;-equation when /(x) = x* + px + g. 

42. Except for 6 = 0, the equation 



a — X b 
b f-x 



= 0, 



lias a real root exceeding a and /, and one less than a and /. [Substitute a and / 
for X in turn]. 
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43. Let the equation in Ex. 42 have distinct real roots a, fi, where a > 0. Then 
there are three real roots of* 

a — z b c 

b f — x g 

h "X 



D{x)^ 



= 0. 



f-x 

c g 

Hint: The results of substituting a and /9 f or x in D{x) are 

[c ViT^ + g Va - a]\ - [c V/-/3- ^ Va - /sf , 

where the product of the radicals in each is +&. Hence if neither a nor /9 is a 
root, there is a root > a, one < /3, and one between a and /9. If a is a root, 
there is a root < fi and hence three real roots. 

44. If a = /3 in Ex. 43, then a = / is a root of D{x) = and there are two 
further real roots. 

45. 



aa' + bb' + cc' 
ae' + bf + eg' 


eo' 
ee' 


+ fb' + gc' 
+ff' + 99' 


= aS 


c 
i 


I'b' 

'7' 


+ ag 


a 
e 


'c' 

'g' 


+ be 


b' a' 
r e' 


+ bg 


b' c' 
f 9' 


+ ce 


c' 


1 
f 


+ cf 


c' h 
9' J 




• 



Combine the first and third, second and fifth, fourth and sixth: 



a b 




a' 6' 




a c 




a c 




6 c 1 


b' c' 




• 




+ 




• 




+ 


1 • 




e f 




e' r 




e g 




e' 9' 




fg I 


f'g' 





a b 


1 


a c 


t 


b c 


=: 




+ 




+ 






e f 




e g 




i 9 



46. Hence, in particular, 

a* + 6* + c* ae + bf+cg 
ae + bf+cg e'+P+g^ 

47. Hence if a, 6, c and e, /, g are the direction cosines of two lines in space, and 
if ^ is the angle between them, so that coa $ = ae + bf + cg^ then sin* $ equals the 
above sum of three squares. 

48. For the determinant in Ex. 43, 



D{x)'D(-'x) = 



a* + 6^ + c* — j' ab + bf+cg ac + bg + ch 

ab + bf + cg b' + P + f - x^ bc+fg + gh 

ac + bg + ch bc + fg + gh c* + ^ + A' — x* 

= -x* + x*{a^ +P + h^ + 2b^ + 2c^ + 2g^)- x\D, + D2 + Z),) + />»(0), 



* This theorem is imF)ortant in many branches of pure and applied mathematics. 
Besides this proof and that in Ex. 48, other more advanced proofs, including that by 
Borchardt, are given in Salmon's Modem Higher Algebraj pp. 48-56. 
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where Da is the first detenmnant in Ex. 46 f or e = 6 and Di and Ds are analogous 
minors of elements in the main diagonal of the present determinant of order 3 
with X = 0. Hence the coefficient of — x* is a simi of squares (Ex. 49). Since 
the function of degree 6 is not zero for a negative value of x^, D{x) = has no 
purely imaginary root. If it had an imaginary root r + ^h then D{x + r) = 
would have a purely imaginary root si. But D{x + r) laoi the form in Ex. 43 with 
a, /, h replaced by a — r^ } — r, h — r. Hence D{x) = has only real roots. 
The method is applicable to such determinants of order n. (Sylvester.) 

49. In Ex. 48, Di-\- D2 + Dz equals 

(a/ - b^y -hiah- c^y +(fh- g^Y + 2 (a^ - 6c)2 + 2 (c/ - bgY + 2{bh- cg)\ 

50. Without using its solution by radicals, prove that 

x^ + hT^ + cx'^ + dx + e 

has a factor x^ — 8X + 'Py where « is a root of a sextic equation, and that p is a 
rational function of « and the coefficients. 

Hints: There are six functions like « = Xi + X2; next, 

C = 2X1X2 = «(X8 + X^ +p-\- X8X4, 

—d — 2x1X2X3 = 8X9X4 + (xi + x\)p. 

Replace X3 + 2:4 by — 6 — « and solve for p the resulting linear equations in 
Xjr4 and p. The case 6 + 2 « = may be avoided by starting with another pair 
of roots. 

51. Prove Ex. 50 by dividing the quartic by the quadratic function and requir- 
ing that the linear remainder shall be zero identically. 

52. Prove Ex. 50 by use of (3) and (8) in Ch. IV. 

53. jf + bx^ '\- cx^ + dj? + ex"^ + }x + g has a factor x^ — 5x + P, where « is a 
root of an equation of degree 15, and p is a rational function of 8 and the coefficients. 
Hints: Write 

<r\ = X3 -h X4 + Xs + ^6, <r2 = ^3X4 + • • • , <n~ X3X4X5 + • • • , <rA — X3X4X6Xe 

for the elementary symmetric functions of Xs, . . . , xe, and show that 

— 6 = S + <ri, C = p + 8<ri + <T2f — d = P<ri + *r2 + <r8, 

e = p<r2 + &r3 + 04, — / = ^A + P<r8, g = P<r4. 

The first four relations determine the as. Then the last two give a cubic and a 
quadratic equation in p, by means of which we may express p as a rational function 
of s and then obtain an equation in « alone. Why must this be of degree 15? 

54. If Ex. 53 were solved as in Ex. 51 (if the quotient of x* + • • • by x* + • • • 
be denoted by x^ — aiJ^ + <yaaJ* — <r3X + <r4, we obtain the above six relations), 
why could we conclude that any equation of degree six with real coefficients 
has two complex roots (independently of the fundamental theorem of algebra)? 
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55. 



3 ai + at + az 

ai + a2 + Oi ai*+ a2'+ ctjf 



= S (ai - a,)'. [See Ex. 46.] 

3 



56. The detenuinant in Ex. 55 equals 



2 



at 



ai a2' 



=i:^ 
^i 



012 



ai 012 



+ 



1 ai 



aj «i 



-1 



2 ai + a2 

ai + as «!*+ 012^ 



=2 



1 at 
1 a. 



57. For n roots, n = 3, 



I> 
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«l 


«2 




Si 


82 


Si 


= 2 


S2 


«S 


8i 





«/ 



= ^ ttd 



«*' 



«*- 



«ik' 



A,i, fc = 1, . . . , n\ 

\i, jj k distinct. / 



Add the six detenninants given by the permutations of fixed i, j, k. Then 

1 +1 +1 ai + ai + ak af + aj^ + ai? 
«• + ay + oik oii^+ oij^+ ait* «»* + oij^ + ak* 



D. r 



i<j<k 



- X 



i<j<k 



1 



a» 



aik 



a*' 



1 «! «! 



ttik aik' 



= 2(ai — «;)*(«,- — a*)*(ay — a*)*. 
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«8 
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58. Comparing the theorems in Exs. 55 and 57 and their extensions with Ex. 12, 
p. 102, we see the nature of a proof of Borchardt's Theorem: An equation of degree 
n with real coefficients and distinct roots has as many pairs of imaginary roots as 
there are changes in signs in the series 



So = n, 



Sn— 1 Sn • • . Sfn— 2 

If two consecutive terms are zero, the theorem may fail, as x* + 1 = shows. 
But it holds if an isolated zero occurs and is suppressed. 

59. Denote the last series by Di = «o, ^^2, Ds, . . . , Dn. There are exactly 
r distinct roots of the given equation of degree n if and only if Dr is the last non- 
vanishing determinant of this series. For, as in Exs. 55-57, Dk is the sum of the 
various products of the squares of the differences of A: of the roots ai, . . . , a». 
If A; > r, each product involves two equal as and hence Dit =* 0. If A; « r, the 
only term not zero is that involving the r distinct a a, so that Dr ^ 0. (L. Baur, 
Math, Annalerif vols. 50, 52.) 
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60. The n roots are all real and diAtinct if and only if Dt, . 
positive. (Weber, Algebra, 2d ed., I, p. 322.) 

61. If each c,- is real and if the numbers 


.. y Z)n are all 


Co, 


Co Ci 
Ci Cj 


y . . . y 


Co 

Ci 


Cl ... Cn 
Ci ... Cn+1 




are positive, all of the roots of 


Cn 


Cn+l- • • C2n-f2 




Co + CiX + CjX* .+ • • ' 


' + 


Cna:''* = 





are imaginaryy and all but one of the roots of 

Co + CiX + C2a;* + • • • + Cjn+ia:*'*"'"^ = 



are imaginary. (Van Vleck, Annals of Math., 4 (1903), p. 191.) 



62. The results in Ex. 61 follow if the C2i and 



C2»+l Cii+2 



are all positive. 



(Kellogg, Annals of Math., 9 (1907), p. 97.) 

63. If the terms with negative coefficients in an equation of degree n are — ox 
— /Jx'*"*, — -yx""* , . . . y no positive root exceeds the simi of the two largest of 
the nimibers 

v^, V^, v^7, .... (Lagrange.) 

64. In Ex. 63, no positive root exceeds the greatest of the nimibers 



(Cauchy.) 



where k is the number of the negative coefficients —a, .... 

65.* Define Fa as in Ch. IX, § 8, and let /(a) 9^ 0, /(6) ?^ 0. If /(x) = has 
imaginary roots, Va — Vb cannot give the exact nmnber of real roots in every 
interval [a, 6] ; but, if /(x) = has no imaginary roots, Va — Vb gives the exact 
number of real roots in every interval [a, 6]. Hint: Use (14), Ch. IX. 

66. Budan's Theorem gives the exact number of real roots of fix) = in 
[a, 6] if /(a) 7^ 0,/(6) 9^ 0, provided that, for r = 0, 1, . . . , n — 2, real roots of 
^(^(x) = separate those of /('^^)(x) = in that interval from each other and from 
a and 6. The term "separate'* here excludes the case of coincidence. Hint: At 
a root of f^'^'^^^ix) = 0, the functions f^^K^) and /(*""'"2>(x) must be of opposite sign. 

67. Descartes' Rule gives the exact number of real roots only when Budan's 
Rule 13 exact for every positive interval [a, 6]. Thus it is exact for an equation 
having only real roots. 

68. We define as generalized Sturm's functions for an interval [a, 6] a sequence 
of polynomials /(x), /i(x), . . . , /r(x), with the following properties: 

* The author is indebted to Professor D. R. Curtiss for Exs. 65-72. 
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(a) No two consecutive functions vanish simultaneously at any point of [a, 6]; 

(b) fr(x) does not vanish in [o, 6]; 

(c) When, for 1 = i = r — 1, fi(x) vanishes for a value of Xi in [a, 6], /i-i(xi) and 
fi+i(xi) have opposite signs; 

(d) When f(x) vanishes for a value Xi in [a, 6], /i(xi) has the same sign as f(xi). 
Prove that the number of real roots of /(x) = in [a, 6] is equal to the difference 

between the nimibers of variations of signs in such a sequence f or x = a and for 
X = 6. 
Prove the corresponding statement for an interval [c, d\ within [a, 6], 

69. Prove that generalized Sturm's functions for any interval [a, 6], where 
a and 6 are both positive or both negative and /(x) = has no multiple roots, may 
be obtained as follows: Take /i(x) = /'(x). Arrange /(x) and /i(x) in ascending 
powers of x, and divide the former by the latter (using negative powers of x in the 
quotient, if necessary) ; let the last remainder of degree equal to that of /(x) be 
designated by r2(x); then /2(x) = — r2(x) -r- x*. Define /i(x) similarly by division 
of /i-2(x) by /t_i(x), both being arranged according to ascending powers of x; 
the last remainder of degree equal to that of fi-iix) is divided by — x* and the 
quotient taken as /i(x). Show that the sequence thus obtained is valid for 
[—00, ooj, provided no one of the functions vanishes for x = 0. 

70. Ptove that generalized Sturm's functions for any interval [a, 6], where 
a and b are both positive or both negative and /(x) = has no multiple roots, may be 

obtained by the greatest common divisor process for /(x) a aox'*+aiX'*"^+ • • • +(U 
and /i(x), with the signs of the remainders changed (as in Sturm's method), if we 
take 

/i(x) = 4>(x) s aix*-! + 2(hX'"-^ + ' ' ' +nan (« < 0), 

but /i(x) = —4>(x) if X > 0. Hint: x/'(x) + ^(x) = vf(x), 

71. Prove the analogue of Ex. 69 when /i(x) is taken as in Ex. 70. 

72. For the cubic /(x) s oox^ + Oi«* + Oax + oi without multiple roots, discuss 
the validity of the sequences in Exs. 69-71 for any interval [a, 61, where a < 0, 
6 > 0. Hint : If as 5^ 0, discuss whether variations of signs for x very near zero 
and negative = variations of signs for x very near zero and positive. 
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